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Recent studies have shown that the number counts of convergence peaks N(k) in weak lensing 
(WL) maps, expected from large forthcoming surveys, can be a useful probe of cosmology. We 
follow up on this finding, and use a suite of WL convergence maps, obtained from ray-tracing 
N-body simulations, to study (i) the physical origin of WL peaks with different heights, and (ii) 
whether the peaks contain information beyond the convergence power spectrum Pi. In agreement 
with earlier work, we find that high peaks (with amplitudes ;> 3.5<r, where a is the r.m.s. of the 
convergence k) are typically dominated by a single massive halo. In contrast, medium-height peaks 
(« 0.5 — 1.5a) cannot be attributed to a single collapsed dark matter halo, and are instead created 
by the projection of multiple (typically, 4-8) halos along the line of sight, and by random galaxy 
shape noise. Nevertheless, these peaks dominate the sensitivity to the cosmological parameters 
w,a$, and fi m . We find that the peak height distribution and its dependence on cosmology differ 
significantly from predictions in a Gaussian random field. We directly compute the marginalized 
errors on w, as, and Sl m from the N(k) + Pi combination, including redshift tomography with source 
galaxies at z s = 1 and z s = 2. We find that the N{k) + Pi combination has approximately twice 
the cosmological sensitivity compared to Pi alone. These results demonstrate that N(k) contains 
non-Gaussian information complementary to the power spectrum. 

PACS numbers: PACS codes: 98.80.-k, 95.36. +x, 98.65. Cw, 95.80.+P 



I. INTRODUCTION 

Weak gravitational lensing (WL) by large-scale cos- 
mic structures has emerged as one of the most promising 
methods to constrain the parameters of both dark energy 
(DE) and dark matter (DM) (e.g. ref. pQ; see also recent 
reviews in refs. [2j [3]). While linear and mildly nonlin- 
ear features in WL maps have been thoroughly explored, 
an important question that remains is: how much addi- 
tional information lies in the nonlinear features of these 
maps? Motivated by this, we recently investigated a sim- 
ple nonlinear statistic - counting peaks in WL maps di- 
rectly as a function of their height and angular size ([!]; 
hereafter Paper I). This statistic does not lend itself to 
straightforward mathematical analysis - it requires nu- 
merical simulations and has received relatively little at- 
tention (e.g. 0E]) until recent simulation work |U[7]. 

In Paper I, we identified peaks in ray-tracing N-body 
simulations, defined as local maxima in two-dimensional 
convergence maps. We found that the number of peaks 
as a function of their height K pca k has a sensitivity to 
a combination of (u>, ag) competitive with other forth- 
coming cosmological probes. Dietrich and Hartlap [7] in- 
vestigated peak counts as a function of 17 m and as, and 
reached qualitatively similar conclusions. 

One result identified in our study is that the cosmo- 



logical sensitivity arises primarily from medium-height 
peaks, with amplitudes of ~ 0.5 — 1.5a, where a is the 
r.m.s. of the WL convergence k. Dietrich and Hartlap 
include only higher significance peaks in their analysis 
(with >2.2a) 1 but they find a similar trend, namely 
that most of the cosmological information is contained in 
the lowest significance peaks. In Paper I, we also found 
that, somewhat counter-intuitively, the number of the 
medium-height peaks decreases with increasing erg." 

Motivated by these findings, here we attempt to clar- 
ify the physical origin of the medium amplitude peaks, 
by identifying collapsed dark matter halos along sight - 
lines to individual peaks. The fact that the cosmological 
sensitivity is driven by relatively low-amplitude peaks 
raises a potential concern: these peaks may be domi- 
nated by galaxy shape noise, and/or may arise from ran- 
dom projections of large-scale overdensities in the mildly 
nonlinear regime. The counts of the medium peaks may 
then offer little information beyond conventional statis- 
tics, such as the power spectrum. Our second aim in this 



Note that Dietrich and Hartlap refer to these as peaks with a 
signal-to-noise ratio of J> 3.2. This is because our definition of 
<t includes both shape noise and the cosmological large-scale- 
structure signal; these are comparable (see § III A| below). 
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paper is therefore to investigate the origin of the cosmo- 
logical information content of the WL peaks. To this 
end, we compare peak-height distributions in different 
cosmologies with those expected in corresponding Gaus- 
sian random fields (GRFs). Further improving on Paper 
I, we are able to provide marginalized constraints from 
the combination of peak counts and the power spectrum 
for the cosmological parameters erg, w, and Q m , not just 
parameter sensitivity, since we ran a much larger set of 
simulations. 

This paper is organized as follows. In § II, we describe 
our calculational procedures, including the creation of 
the WL maps, the identification of collapsed halos, the 
prediction of peak counts in a GRF, and our statistical 
methodology to compare maps. In § III, we present our 
results, which include the matching of peaks and halos, 
matching peaks in different cosmologies, and the com- 
parisons of the simulated peak counts to the Gaussian 
predictions. In § IV, we offer a detailed discussion of 
our main results, as well as of several possible caveats 
and extensions. Finally, in § V, we summarize our main 
conclusions and the implications of this work. 



II. METHODOLOGY 

A. Simulating Weak Lensing Maps 

We generate a series of 80 cold dark matter N-body 
simulations for 7 different cosmological models with the 
code GADGET-2, which include DM only (no baryons). 
As our fiducial model, we adopt a ACDM universe with 
the following parameters: cosmological constant JIa = 
0.74, matter density parameter fl m = 0.26, Hubble con- 
stant H = 72 km s _1 Mpc _1 , dark energy equation-of- 
state parameter w — — 1, and a primordial matter power 
spectrum with a spectral index of n s = 0.96 and present- 
day normalization of erg = 0.798. These values are consis- 
tent with the seven-year results by the WMAP satellite 

All simulations use 512 3 DM particles, in a box with a 
size 2 of 240/i _1 Mpc. This corresponds to a mass resolu- 
tion of 7.4 x 10 9 h~ 1 MQ. The output of each simulation 
run consists of snapshots of particle positions at various 
rcdshifts between z — and z — 2, with output red- 
shifts chosen to span intervals of 80/i _1 Mpc along the 
line of sight (LOS) in the fiducial model. 3 This interval 
is shorter, by a factor of three, than our box size; we trun- 
cate the cubes along the LOS to remove the overlap. We 
apply random shifts and rotations to each snapshot cube, 
and create gravitational potential planes at each output 



2 Unless stated otherwise, all quantities in this paper are quoted 
in comoving units. 

3 In cosmologies with different distances, the same redshift is cho- 
sen. 



by projecting the particle density onto a 2D plane per- 
pendicular to the line of sight, located at the output red- 
shift, and solving the Poisson equation. We swap planes 
from several independent simulations for the same cos- 
mology in creating the light cone, to reduce the reuse of 
the same simulation box and to make the final WL maps 
more pseudo-independent. We then follow 2048 x 2048 
light rays, starting from z = 0, and calculate the distor- 
tion tensor and lensing deflection angles at each plane, 
and produce the final convergence maps. 

The interested reader is referred to Paper I for more 
details about the simulations and the process of mak- 
ing the maps. We made one important change, however, 
which must be high-lighted. While in Paper I, the den- 
sity and potential planes had a resolution of 2048 x 2048 
(same as the resolution used for ray-tracing and the fi- 
nal convergence map), here we adopted a higher resolu- 
tion, 4096 x 4096, for both the density and the potential 
planes. This change has been proven to be important as 
lower resolution yields a loss in power at large wave num- 
ber, as demonstrated by [S] ■ In Fig. [TJ we compare the 
2D angular power spectrum of the WL convergence field 
at 4096 2 resolution with the theoretical power spectrum. 
The latter was obtained by direct line-of-sight integra- 
tion, using the Limber approximation, and the fitting 
formulae for the nonlinear 3D matter power spectrum 
from [10] . calculated with the code Nicaea 4 [TT] . In the 
range 400 <£ <30, 000, the power spectrum derived from 
our maps is not significantly suppressed by either the fi- 
nite box-size or resolution, and it agrees well with the 
theoretical expectation. 

Once the maps have been created, we take the redshift- 
dependent r.m.s. of the noise in one component of the 
shear to be [T3] 

cr A (z) = 0.15 + 0.035z. (1) 

Note that this corresponds, in the weak lensing limit, to 
an r.m.s. ellipticity of 0.30 + 0.07 z [j"4] . 

For simplicity, in our analysis we assume that the 
source galaxies are located on a source plane at a fixed 
redshift, with n ga i = 15 arcmin~ 2 . We apply a Qq — 
1 arcmin Gaussian smoothing to the maps. The noise 
cr 2 oise in the convergence after the Gaussian smoothing 
then becomes 

2 = (<rj) r9 s 

°noisc o al ' \ > 

For reference, we note that at redshift z s = 2, the above 
gives (Tnoiso = 0.023, very close to the r.m.s. of the con- 
vergence <r K = 0.022 in the noise-free maps. The r.m.s. 
of the total convergence field, with noise included, is 
a K = 0.031. 



4 Available at www2.iap.fr/users/kilbinge/nicaea 
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FIG. 1: Angular power spectrum of the WL convergence as 
a function of spherical harmonic index £. The solid curve is 
the average over 1000 simulated 12- square- degree convergence 
maps from ray-tracing through 45 independent N-body simu- 
lations for the fiducial cosmology. The error bars indicate the 
variance between the maps in the bins plotted. The dashed 
curve is the theoretical prediction, based on the 3D nonlin- 
ear matter power spectrum 11 Of . with the Limber approxima- 
tion f 10/ . Source galaxies are assumed to be located at z s — 2. 
No intrinsic ellipticity noise or smoothing were added. 
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TABLE I: Cosmological parameters varied in each model. The 
universe is always assumed to be spatially flat (T2a + fim = !)■ 



C. Halo Finding 

We use the publicly available AMIGA halo finder ([15]: 
hereafter AHF) to identify collapsed halos in our N-body 
runs. AMIGA finds halos based on an iterative density 
refinement scheme. Its output consists of the 3D posi- 
tions of the halos, and, importantly for us, the tagged 
set of particles belonging to each halo. The virial radius 
of a halo is such that when a sphere is placed at the halo's 
location, with a radius r v j r , the overdensity p{r v - K ) is 



= A, 



tPb, 



(3) 



where pb is the background baryon density, and where 
Avir = 180 is adopted in this study. The mass of the 
halo is then simply given by 



M vir = 47rp 6 A V i r r-^ r /3. 



(4) 



B. Suite of Simulated Maps 



In addition to the fiducial cosmology, we have run N- 
body simulations in six other models. Each of these dif- 
fers from the fiducial model in a single parameter: we 
vary as, w, fl m in both directions, with values erg = 0.750 
and 0.850; w = -0.8 and -1.2; and fl m = 0.23 and 0.29. 
We assume the universe always stays spatially flat (i.e. 
Qj\ + fi TO = 1). The seven different cosmologies will here- 
after be referred to as the fiducial, high-ers, low-as, high- 
ly, law-w, high-fi m , and low-r2 m models, respectively, as 
summarized in Table [I] In each of these N-body runs, 
we create 1000 different WL maps with source galaxies at 
z s = 1, and another 1000 maps with galaxies at z s = 2. 
Each map covers a solid angle of 3.46 x 3.46 degrees. All 
maps were created by mixing potential planes randomly 
among five different N-body runs, with independent real- 
izations of the initial conditions, in the given cosmology. 
Finally, we created an additional 1000 control maps, us- 
ing the planes from 45 additional independent N-body 
runs in the fiducial model. Having 9 times more strictly 
independent realizations allows us to compute the covari- 
ance matrix more accurately (needed for computing A% 2 ; 
see below), and to check the robustness of our results to 
different realizations of the fiducial model. 



As a simple test of both our N-body simulations and 
our implementation of the halo finder, we reproduce the 
fitting formula (their equation B3.) for the halo mass 
function reported by Jenkins et al. |16) . An example of 
this comparison is shown, in our fiducial model at z = 
0, in Fig. [2] Overall, we find excellent agreement, with 
an accuracy of 25% or better up to a halo masses of 
2 x 1O 14 M . For larger masses, there is a large scatter. 

One of the goals in this study is to identify halos con- 
tributing to each individual convergence peak. To do 
this, starting from each peak, we follow the light ray, 
and record the information (masses and location) of the 
halos found within a light cone centered on the peak, 
with a radius of 3 arcmin. This radius is chosen to be 
three times the smoothing scale. We have verified that 
doubling the radius of the light cone does not change our 
halo matching results below - in the sense that no addi- 
tional halos are identified that contribute significantly to 
the total convergence of a peak (see detailed discussion 
below). To be consistent with the perpendicular projec- 
tion of the particle density in each simulation snapshot, 
the light rays consist of a series of parallel line segments, 
which are perpendicular to the potential planes. The co- 
ordinates where the segments cross the potential planes 
are determined by the lensing deflection angle, computed 
and stored during the ray-tracing analysis. For the same 
reason, the light cones are composed of a series of paral- 
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FIG. 2: Halo mass function, produced with the AMIGA halo 
finder [151, in our N-body simulation of the fiducial model at 
z = 0, compared to the fitting formula from Jenkins et al. 



lei cylinders with a radius of 3 arcmin, centered on the 
corresponding light ray. 

As explained above, we truncate the simulation snap- 
shots, in order to generate more independent realizations 
of maps. As a result, occasionally, parts of halos that 
happen to be located near the plane of the truncation 
can be unphysically "cropped" . These cropped halos be- 
come important only when they are sufficiently massive 
to contribute to the convergence of a peak, and when 
they are located near (within a fraction of their virial ra- 
dius) one of the two truncation planes (either in the front 
or the back). Given that halo virial radii are of order 
~ l/i^Mpc, and our truncated box size is 80h 1 Mpc, 
the probability that the latter condition is satisfied is 
~(l + l)/80~ 2.5%. We therefore simply restrict our 
halo catalog only to halos that do not touch the edges. 
We have checked that neglecting the cropped halos does 
not significantly affect our results. 

Our simulations resolve the density structure of indi- 
vidual halos with masses of M >few x I0 11 M Q . How- 
ever, each map contains on the order of I0 3 peaks, and 
more than a dozen halos can contribute to the total con- 
vergence of a single peak; therefore, computing the exact 
contribution of each halo to each peak is computationally 
impractical. Instead, we replace each halo by a spheri- 
cally symmetric NFW [T7] halo with the same virial mass 
Af v j r . For a given impact parameter d (defined as the an- 
gular distance between the halo center and the point of 
closest approach of the light ray corresponding to the 
peak), redshift z, and mass M v - a , the contribution of the 
off-center halo to the convergence peak can then be cal- 
culated analytically. 

More specifically, the density profile is assumed to fol- 
low 



where r is the radius from the halo center, and r s and 
p s are a characteristic radius and density. The profile 
is truncated at r2oo, inside which the mean overdensity 
with respect to the critical density of the universe at red- 
shift z is 200. We adopt the concentration parameter 
c n fw = ?*200 As = 5 m this paper. The convergence due 
to the halo, given an extended redshift distribution of the 
background galaxies, is then given by 

... 4t:G S(0)x, C dz'{dn/dz'){l - Xz / Xz .) 
K W = Ti , \ -( 6 ) 



(1 



n-tot 



Here X(0) is the projected surface density of the halo 
(given explicitly in ref. P2]; see their equations 26-27), 
Xz is the comoving distance to redshift z, dn/dz is the 
surface number density of background galaxies per unit 
redshift, and n to t is the mean total surface density. The 
latter is taken to be rit t5{z — z s ) in this paper, with z s as 
the source redshift. Finally, we use a Gaussian window 
function to smooth the convergence induced by the halo, 
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where the center of the smoothing kernel (0 = 0) is set 
to the angular position of the light ray corresponding to 
a peak, and <po is the angle toward the halo center. The 
smoothing scale 6q is chosen to be 1 arcmin, as in the 
simulated maps. 

To check the accuracy of the NFW approximation for 
the convergence, we selected 81 halos with masses in the 
range 1.5 x 10 12 M < M < 1.5 X 10 14 M in one of 
the realizations of our fiducial model. For each halo, we 
record the value of the convergence k in the map (with 
sources galaxies at z s — 1), in the pixel located in the di- 
rection toward the halo center. We then remove the halo 
from the simulation box, and repeat the ray-tracing pro- 
cedure discussed above, to compute a new value kq at the 
same position, but without the halo. In Fig. |3j we show 
the actual difference Ak = k — kq, against the value K n fw 
expected based on the NFW halo model. As the figure 
shows, the NFW assumption works accurately, although 
it results in a slight underestimate of the convergence. 
The points in the figure yield an average fractional bias 
°f (( K nfw — Ak)/Ak) = —0.067 and an r.m.s. scatter of 

((K„ fw - A K ) 2 /AK 2 y/ 2 =0.15. 



D. Gaussian Random Field Predictions 

One of the key questions to be answered in this pa- 
per, is the extent to which the peak counts contain infor- 
mation beyond traditional measures, such as the power 
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FIG. 3: Comparison of the convergence ft n fw, produced by ha- 
los with an NFW profile, and the difference —Ak in the sim- 
ulated convergence map induced by artificially removing the 
halo from the 3D simulation box. 81 halos, identified in a 
z s — 1 map generated in our fiducial model, were used for 
this exercise. No galaxy noise was added to the maps. The 
NFW assumption works well, with a fractional bias of only 
-6.7 percent, and a scatter of 15 percent, relative to Ak. 



spectrum. For example, if the peaks were produced only 
by a combination of pure galaxy shape noise (which is 
Gaussian by assumption) and linear fluctuations in the 
matter density, then their statistics would be fully de- 
scribed by a Gaussian random field (GRF). The majority 
of the high peaks are known to be associated with col- 
lapsed, nonlinear objects, and their statistics will clearly 
be non-Gaussian. However, the extent to which lower- 
amplitude peaks are non-Gaussian is not clear ab-initio. 
As mentioned in the Introduction, these peaks contain 
most of the cosmological information, and therefore the 
departure in the statistics of these peaks from Gaussian 
predictions is important to understand and quantify. 

As a simple test, we directly compare our simulated 
peak counts with those expected in a GRF. Fortunately, 
the peak counts in a two-dimensional GRF, and their dis- 
tribution in height, are predictable analytically [15]. In 
fact, they depend only on the first and second derivatives 
of the correlation function on small scales (or, equiva- 
lently, the first two moments of the power spectrum). 
For completeness, we reproduce the relevant equations 
here. The differential number of maxima per unit solid 
angle, n max (^), with height in the range v to v + dv, 
where v is measured in units of the standard deviation 
(To of the random field, is given by 



1 



dp 



n m ^{v)dv = — exp(-i/72) /2 G( 7 , 7 »/)(10) 
where 

G( 7 ,z*) = (^- 7 2 )[l-^erfc{ [2(i ^ 2)]1/2 }} 
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FIG. 4: Number of convergence peaks in a Gaussian random 
field, as a function of their height measured in units of the 
standard deviation of the convergence, a§. The data points 
show the number of peaks in bins of width An — 0.25cto, ob- 
tained by averaging counts in 200 random Monte Carlo real- 
izations of a 2d GRF. The input power spectrum was calcu- 
lated from the non-linear matter power spectrum in Smith et 
al. jioy in our fiducial cosmology, with source redshift z s — 2, 
including galaxy noise and smoothing. The map size, after 
excluding 40 pixels along each edge, is 2.88 x 2.88 deg 2 . 
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and where Pg is the continuous 2d power spectrum of 
the convergence field, and £(6) is its two-point correla- 
tion function. Integrating Eq. ( 10 1 over v gives the total 
number of peaks n p k regardless of their height, 



(15) 
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To verify the accuracy of these analytic formulae, we 
produced 200 numerical maps of GRFs, by generating 
200 independent random realizations of the theoretical 
2d weak lensing power spectrum (in our fiducial model, 
assuming a source redshift z s — 2). We first generate a 2d 
complex random field in t space, with the real and imag- 
inary parts of Fourier modes distributed independently, 
following Gaussians with a standard deviation of •v/P^/2. 
Here Pi is the power spectrum [50]. We then perform a 
discrete Fourier transform to produce maps in real space. 
The GRF maps have a size of 2048 x 2048, to mimic 
the actual WL maps. Noise is then added according to 
Eqs. and ([2]), with n ga i=15, and a 1 arcmin smooth- 
ing is applied. To avoid edge effects, we discard pixels 
located within 40 pixels (sj 3x the smoothing scale) of 
the map edges. 

To reproduce the random GRF realizations as closely 
as possible, we do not calculate a®, o\, <y% from the input 
power spectrum; instead, we measure these directly from 
the maps, using the finite difference derivatives (ignoring 
the constant coefficients in Eq. ( 14 ) ) 
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E. Statistical Comparisons 

The basic statistical task in this paper is to assign a sig- 
nificance of the difference between a pair of maps, given 
the stochastic fluctuations in the maps over many real- 
izations. This is required in order to quantify how well 
two cosmological models can be distinguished with the 
peak counts. 

The simplest statistical test consists of computing 
&Xf' f between a pair of cosmologies / and /', using 
the mean number of peaks in each bin, averaged over all 
realizations, 

A X 2 f ,j = dN^^C^rMN^) 

= ^2dN^'' f \c^dNf' f \ (19) 
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where dN> J - J ' = N. 
the average number of peaks in bin i in cosmology /' 
and in cosmology /. Note that i here can label bins 
of different peak-heights, but can also include different 
source galaxy redshifts or smoothing scales. Here 
denotes the covariance matrix of the number of peaks in 
cosmology /, 
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(20) 




(18) 



and 



where re is the 2d convergence field, and 

arc the averages of the corresponding first and sec- 
ond derivatives. For reference, this leads to a prediction 
of 1680 peaks, compared to the actual number 1685 found 
in the noiseless maps; the prediction based on the ana- 
lytic calculation of the cr's is slightly worse, 1649. Wc 
have checked that the situation is similar in the true WL 
maps: measuring 170,1,2 numerically gives a slightly more 
accurate prediction for the total number of peaks than 
calculating 0,1.2 analytically from the power spectrum 
through Eq. Q. 

In Fig. [4j we show predictions from the analytic for- 
mulae (Eqs. (fl0|)-( 14 )), and the mean peak counts in our 
200 mock GRF maps. This tests our reproduction of 
the formulae, as well as the accuracy of our numerical 
measurements of cto, <ti, 02- The agreement is excellent, 
with residuals of only < 2 percent over most of the range 
shown. 

The main advantage of creating mock numerical real- 
izations of the GRF maps is that we can use these to 
measure the (co)variance in the Gaussian peak counts. 
This covariance matrix is necessary to compute the A% 2 
values between pairs of cosmologies in the Gaussian case 
(see discussion below). 



where N£ ' is the number of peaks in bin i in the r 
realization (i.e. convergence map) of the cosmology /, 
and R is the total number of realizations. 

This Ax}' f could be interpreted directly as a likeli- 
hood or confidence level only if (i) the peak count dis- 
tribution in each bin were Gaussian, and (ii) the mean 
peak counts depended linearly on the cosmological pa- 
rameters. As long as the change in parameters is small, 
a Taylor expansion to the first order is a good approx- 
imation, and therefore the second condition is unlikely 
to be strongly violated in our case. We will verify below 
that condition (i) is satisfied to a good accuracy, as well. 
To be specific, we examine directly the distribution of 
the quantity defined as 



x}-{r) 



E^f' ;r) (c (/) ) 

ij 



(21) 



where dN^'' r) = A^ (/ ' ;r) 



f (f) 



N] J ' is the difference between 
the number of peaks in bin i in realization r of a test 
cosmology /', and the average number of peaks in the 
same bin in the fiducial cosmology /. We will show that 
Xf( r ) closely follows a true chi-squared distribution. 

Unless stated otherwise, in this paper, we use five re 
bins to calculate A^ 2 - The bin boundaries are chosen by 
visual inspection, using two rough criteria: (i) the differ- 
ence in the peak height distributions in the two cosmolo- 
gies should not change sign within any of the bins, and 



7 



(ii) the numbers of peaks in each bin should be as com- 
parable as possible. The influence of the number of bins 
and the bin boundaries on our results will be discussed 
in SIIVI below. 



F. Fisher Matrix and Marginalized Error 

With the assumptions that (i) the observables, i.e. the 
mean number of peaks in each bin iVj, depend linearly on 
the cosmological parameters p; and (ii) that for a fixed p, 
the probability distribution of iVj follows a Gaussian, the 
marginalized error on each parameter can be calculated 
using the Fisher matrix (e.g. ref. [21]). 

The Fisher matrix for point p in the parameter space 
is given by the matrix trace 



1 



a/3 



TriC'-'C.^C-'C^ + C^Map], (22) 



with 



M, 



a/3 



N, a N T p+N,pN T a , 



(23) 



where the Greek indices refer to model parameters, a 
comma preceding an index denotes a partial derivative 
with respect to the corresponding parameter, is the 
covariance matrix of iVj's, and iVj is the expectation 
value of N t . In this paper, we will consider only the 



second term in Eq. (22) above. The constraints through 



this term arise from the dependence of the mean num- 
ber of peaks Ni on the cosmological parameters. In 
principle, additional constraints could be available from 
the first term, which represents the dependence of the 
(co)variances C%j = (JVjJVj) on cosmology (see refs. [22] 
and [23] for related points in the context of cluster 
counts). Our results suggest that this dependence is rel- 
atively weak; in practice, however, we do not have a suf- 
ficient number of independent realizations to accurately 
evaluate this dependence. From the Fisher matrix, the 
marginalized error on the parameter a is calculated sim- 
ply as a a — {F~ l ) X Ja . (In contrast, the square root of 
Eq. ( 19 ) divided by the difference between model param- 



eters corresponds to the parameter sensitivity with all 
other parameters fixed.) 



III. RESULTS 

A. The Physical Origin of Peaks and of their 
Cosmology Dependence 

In general, peaks in the convergence field can arise 
for different reasons. They could be caused by (i) one 
or more collapsed halos along the LOS; (ii) large-scale, 
mildly overdense filaments, seen in projection; (iii) non- 
linear "protoclusters" that are on their way to collapse, 
but have not yet virialized and settled into an equilib- 
rium structure |24U25| ; and (iv) pure galaxy shape noise. 



In reality, peaks can be produced by a combination of 
the above effects. 

Relatively high-amplitude ( >3.5<r) lensing peaks have 
been studied thoroughly in the past (e.g. [2"6H28j). and 
it is known that a large fraction of these peaks is at- 
tributable to a single collapsed massive halo. Our mo- 
tivation to revisit this topic is that the cosmological in- 
formation is contained primarily in the lower-amplitude 
peaks, whose origin has not yet been clarified. As de- 
scribed in section II, we start with each individual peak, 
and identify all halos along the sightline. Likewise, for 
each halo, we identify peaks that are located within a 3 
arcmin distance from the 2d sky position of the halo. 

We would also like to know why the number of peaks 
changes with cosmology. To help clarify this, we examine 
realizations of pairs of models with different values of 
as with quasi identical initial conditions. In these pairs 
of models, we use the same random seeds to generate 
the amplitudes and phases for the Fourier modes of the 
density and velocity field at redshift z = 0, and then scale 
these back to the starting redshift of the simulation, with 
the linear growth factor. Since the power spectra differ 
only by an overall normalization, and since the growth 
factor is independent of og, the initial conditions, as well 
as the final WL maps, maintain very similar patterns. 
Therefore we can attempt to match individual peaks in 
the two cosmologies, to see what happens to a given peak, 
when, say, Og is increased. 

Results in this section are based on 50 realizations of 
the noisy maps in the fiducial and in the low-us mod- 
els, with source galaxy redshift z s — 2, n ga i = 15 and 
1 arcmin smoothing. The maps have an angular size 
3.46 x 3.46 deg 2 , 2048 x 2048 pixels. To avoid edge ef- 
fects, 30 pixels(« 3 times the smoothing scale) from all 
four edges of each map are discarded, leaving an area of 
3.36 x 3.36 deg 2 . The fiducial model is used for studying 
the origins of the peaks, and a comparison between the 
fiducial and the low-as model is used for studying the 
cosmology-sensitivity. 

We first perform an analysis of peak-halo matching, 
closely following Hamana et al. 27J, for both high and 
medium peaks. High peaks are defined as those with 
v > 4.8, where v is the maximum value K poa k of the peak, 
in units of the standard deviation of the noise field. We 
restrict our halo catalog to halos that are expected (based 
on the NFW approximation; Eqs. ([5])-([6]) with O = 0) to 
produce a peak with a height of v n { w > 4.8. Similarly, for 
medium peaks with 1.1 < v < 1.6, we restrict our halo 
catalog to those halos with 1.1 < ^ n f w < 1.6. We carry 
out the peak-halo matching by searching for a matched 
pair candidate within a radius of 1.8 arcmin from the 
peak position or from the halo center. This maximum 
angular separation is chosen, as in [27], so that it is larger 
than the smoothing radius of 1 arcmin, while it is still 
smaller than the angular virial radius of a massive halo at 
z <1.3, where the lensing kernel has the most weight. If 
there is more than one candidate pair within this radius, 
we follow [37] and adopt the closest one as the primary 



class 


matching result 


number of matches 






(high peaks) (medium peaks) 


i 


halo peak 


526 (0.93) 


2653 (4.7) 


ii 


halo with no paired peak 


230 (0.41) 


19609 (35) 


Hi 


peak with no paired halo 


2264 (4.0) 


24709 (44) 


iv 


halo peak 


2 (0.0035) 


90 (0.16) 


V 


halo <= peak 


12 (0.021) 


194 (0.34) 



TABLE II: Matching of halos and peaks similar to Hamana 
et al. 127( , but shown separately for high and medium peaks. 
In the second column, "<4> " indicates a primary match in both 
directions, whereas =>■ and <= indicate a primary match in 
the direction of the arrow only. In total, our 50 realizations 
of 3.36 x 3.36deg 2 maps contain 2,802 high peaks and 27,556 
medium peaks, whereas the three-dimensional N-body outputs 
contain 758 massive halos and 22,352 medium halos with cor- 
responding masses. The numbers in () show the number per 
deg 2 averaged over 50 realizations, to be compared with the 
results of Hamana et al. \27j (see text for discussion). 



candidate. 

The 50 noisy maps contain a total of 2,802 high peaks 
and 27,556 medium peaks. For comparison, the halo 
catalogs contain 758 massive halos with v a f w > 4.8 and 
22,352 medium-sized halos with 1.1 < v n f w < 1.6. Fol- 
lowing Hamana et al. [27], we sort the results of the 
matches into the following five categories: (i) both a halo 
and a peak are each other's primary pair candidate; (ii) a 
halo without a paired peak (Hi) a peak without a paired 
halo; (iv) a halo has a matched peak, but is not the pri- 
mary matched halo of that peak; and (v) a peak has a 
matched halo, but it is not the primary matched peak 
of that halo. Table |TT] shows the number of matches, 
separately for high and medium peaks, falling into each 
category. 

Bearing in mind differences in redshift, noise, and peak 
height thresholds, our results are in reasonable agreement 
with [27] ■ Overall, we have found, on average, 5.0 deg -2 
high peaks and 1.3 deg -2 massive halos. The majority 
(70%) of the massive halos produce a one-to-one match- 
ing peak, although these account only for 20% of the 
total population of high peaks. More specifically, [27] 
finds 23 deg -2 high peaks and 8.1 deg -2 massive halos. 
Their matches per deg 2 in categories i, ii, and Hi are 
5.9, 2.1, 15 accordingly. In general, we have found fewer 
peaks and halos than [27] , which can be attributed to our 
lower erg and f2 m values. Our threshold of high peaks is 
also not identical to theirs. However, we have checked 
that the number of halos above a threshold is in good 
agreement with their equation 15 [27], when our cosmo- 
logical parameters are used. We have found a slightly 
lower completeness of the halos (less fraction of halos in 
category i and more fraction in category ii), and also a 
~ 50% lower purity (more fraction of peaks in category 
Hi). We have chosen a higher source redshift, z s = 2 
( [2"7] used z s = 1), this makes projection effects more im- 
portant, and may explain why our completeness is lower. 
Our noise is slightly larger, and our simulations and final 
maps have higher resolution, compared to [37J - these 



effects tend to increase the number of peaks relative to 
number of halos, and to reduce the purity of identifying 
halos. 

The most important result in Table [II] is that, in differ- 
ence from massive halos, only a small fraction (12%) of 
the 22,352 medium-sized halos produce a medium peak 
with a one-to-one pair, accounting for less than 10% of 
all medium peaks. We conclude that the close agreement 
in the number of medium peaks and halos (27,556 vs. 
22,352) is a coincidence - and noise and projection ef- 
fects are much more important for medium peaks than 
for high peaks. 

We next extend the above analysis, by identifying all 
halos, down to low masses, by using a larger, 3 arcmin 
cone around the LOS toward each peak, and by comput- 
ing the expected contribution of each halo Ki to the total 
convergence at the position of the peak (based on the 
NFW approximation). We then rank the halos accord- 
ing to their m values (starting from highest and going 
down to the lowest). We add the noise K no i se at the peak 
location to this ranked list, and ask the following ques- 
tion: starting from the highest value, how far down this 
ranked list do we need to sum the contributions, before 
they account for > 50% of the total peak height? 

In the upper two panels of Fig. [5j we plot the dis- 
tribution of this quantity; a "0" indicates that noise is 
the single largest contributor, and already accounts by 
itself for most of the peak, "1" indicates that at least 1 
halo had to be included, etc. These panels clearly show 
that the large majority of high peaks are dominated by 
a single halo, which accounts for at least half of the peak 
amplitude. Most of these halos fall below the expected 
threshold v n f w = 4.8. The high peaks thus have a much 
better one-to-one match with halos than Table |H] implies , 
once lower-mass halos are included. On the other hand, 
the large majority of medium peaks are dominated by 
noise. 

Since noise does not contain any cosmological infor- 
mation, in the bottom two panels of Fig. [5] we repeat 
the same exercise, except that the noise contribution is 
excluded, and we show the number of halos required 
to account for the total halo contribution. These pan- 
els show that while the high peaks are typically dom- 
inated by a single halo, the contributions from a sec- 
ond (or higher-rank) halos is often ( >50% of cases) as 
important. In the case of medium peaks, however, it 
is very rare (< 5% of cases) for a single halo to domi- 
nate the cumulative halo contribution. Instead, there is 
a broad distribution, but typically (in ~half the cases) 
4 — 8 halos are required to account for >half of this total 
halo contribution. As a sanity check, we have computed 
the analogous distribution for random directions on our 
maps (i.e., not toward peaks). For these random direc- 
tions, as shown in Figure [6] the distribution has an even 
broader shape, centered at 8, and generally shifted to- 
ward larger numbers of halos. This reassures us that the 
medium peaks still do preferentially pick out directions 
toward conjunctions of ~ 4 — 8 halos. We also exam- 
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FIG. 5: These figures illustrate the number of halos, as well as noise, contributing to medium and high peaks in 50 realizations. 
For each peak, we identify all halos along the sightline, and rank them (as well as noise) according to their contribution to the 
peak convergence. In the upper two panels, we show the distribution in the number of halos required to account for > 50% of 
the total peak height (with "0" corresponding to cases in which noise alone explains half of the peak value). In the lower two 
panels, noise is excluded, and we show the distribution in the number of halos required to account for > 50% of the total halo 
contribution. Error bars are estimated as the the standard deviation of the number of counts in each bin, multiplied by 



ined the masses and redshifts of these dominant ^4 — 8 
halos. We have found that the masses range between 
few x 10 12 M Q < M < few x 10 13 M Q , and found no cor- 
relations in redshift (i.e., the contributing halos are not 
part of a single structure). Finally, we find that simply 
adding up the expected re contribution of all halos along 
the LOS to a peak always overproduces the re value of 
the peak (not surprising, since this neglects the re deficit 
from underdense regions). 

We next wish to clarify why there are fewer medium- 
height peaks when a% is increased (and vice versa). One 
can intuitively guess that increasing a% simply increases 
the "scatter" due to large scale structures. In the linear 
regime, changing erg simply changes the local (3d) density 
contrast, by the same factor everywhere. Pretending that 
this holds in the nonlinear regime, it is easy to see that 
the set of peaks would be invariant under changing a% 
- however, positive peaks would be enhanced, and nega- 
tive peaks (i.e., maxima residing inside large-scale voids) 
would become yet more negative. This would broaden 
the peak-height distribution, and reduce the number of 
peaks near re poa k ~ 0. Of course, this picture is over- 
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FIG. 6: As in the lower panels in Figure^ we show the distri- 
bution in the number of halos required to account for > 50% 
of the total halo contribution. Here we contrast these distribu- 
tions for medium-height peaks and for randomly chosen direc- 
tions on the sky. The figure demonstrates that medium-height 
peaks preferentially pick out directions toward conjunctions of 
~ 4 — 8 halos. 
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simplified: as erg is increased, the change in the density 
field is not a simple re-scaling; furthermore, peaks can be 
destroyed and new peaks can be created. Indeed, we al- 
ready know that the total number of peaks changes with 
erg (Paper I). 

To understand the dominant effect, we attempt to fol- 
low and match individual peaks in a pair of models with 
different cr 8 . We have found that a direct matching of 
peaks is not possible, because the locations of the peaks 
tend to shift (by several to more than 20 pixels) and 
therefore the correspondence between peaks in two differ- 
ent maps remains ambiguous (except for the very highest 
and most conspicuous peaks). Instead, we proceed by us- 
ing halos as intermediate proxies for the peaks. Starting 
from a peak in the first cosmology, we identify the halo 
that contributes most to this peak. For this analysis, we 
consider only those peaks for which the matched primary 
halo contributes at least 10% of the total halo contribu- 
tion (otherwise it is unfair to use the halo as a proxy 
for the peak). We next search the entire halo catalog 
in the second cosmology, and identify the "same" halo, 
by finding the one that shares most common particles 
with the halo in the first cosmology. Finally, we search 
through the peaks in the second cosmology within a cone 
of 3 arcmin around the halo, requiring that their height is 
within the range ±<7 no i se of the original peak in the first 
cosmology, and that the "same" halo contributes at least 
10% of the total halo contribution. If no such peak is 
found, then the match is declared unsuccessful. If more 
than one such peak is found, we select the one to which 
the matched halo contributes the most. 

Fig. [7] shows the results of the above matching proce- 
dure between the fiducial and the low-erg models, for both 
high peaks (upper two panels) and medium peaks (lower 
two panels). Focusing on the high peaks first, there 
are 1,987 high peaks in the lower erg model, and 2,802 
high peaks in the fiducial model. In the upper-left panel, 
we start with the peaks in the low-er s model, and show 
their matches in the fiducial model. In the upper-right 
panel, we reverse the direction, and start with the fiducial 
model. We find an "unambiguous" match (in the sense 
defined in the preceding paragraph) for 87.1% and 80.8% 
of the peaks, respectively. Proceeding to the two lower 
panels, we show the results for the medium peaks. There 
are 29,097 medium-height peaks in the low-erg model; 
only 56.2% of these have a matching peak in the fiducial 
model. Likewise, starting from the 27,556 medium-height 
peaks in the fiducial model (bottom panel), we find that 
55.7% of these have a match in the low-as cosmology. 

In conclusion, most high peaks are matched to a peak 
in the other cosmology, although the peak n values in 
the two cosmologies differ by an amount comparable to 
"noise- In contrast, about half of the medium- height 
peaks do not have a clear match in the other cosmology. 
This is despite the fact that we use the same realiza- 
tion of the noise map in both cosmologies (i.e., we avoid 
creating an entirely different set of peaks by a different 
noise-realization), and despite our rather lenient defini- 



tion of a "match" . We speculatively interpret this result 
as follows. Since the medium peaks are typically created 
by the sum of pure noise and many halos in projection, 
their existence and their amplitude are both sensitive to 
small changes in the spatial distribution and masses of 
these halos. In indirect support of the above conclusion, 
we have identified the following trend: on average, the 
proxy-halo contributes 49.6% for high peaks that have 
a match, vs. 33.8% for those that do not. Similarly, 
for medium peaks, the proxy halo contributes 25.1% vs. 
18.9% for match vs. unmatched peaks. This shows that 
whenever the dominant halo accounts for a smaller frac- 
tion of the peak k, it is more "fragile" and is less likely 
to have a match in the other cosmology. Finally, we 
have found that when as is increased, then the matched 
halo in the higher-erg cosmology has typically grown more 
massive, with an increase by 15% and 12% on average 
for matched and unmatched high peaks, 16% and 14% 
for matched and unmatched medium peaks. This trend, 
however, does not hold in the k values of the matched 
peaks; as Fig.[7]shows, the peak heights have a significant 
scatter, but a relatively low bias, between the cosmolo- 
gies. This could be explained by the fact that underdense 
voids become even more underdense when as is increased, 
which tends to "cancel" the increase in K pea k caused by 
the fattening of halos. To be specific, we find a frac- 
tional bias ((k 2 — Ki) / Ki) = 0.034 and an r.m.s. scatter 
((k2 — Ki) 2 / nl) 1 / 2 = 0.19. when high peaks in lower er 8 
model are matched to high peaks in fiducial model. In 
the reverse direction, the bias and scatter are -0.088 and 
0.17. These results agree with the upper two panels in 
Fig. [7| showing a positive bias when as is increasing, and 
a negative bias when as is decreasing. The bias for the 
medium peaks is less clear than for the high peaks. In 
fact, the bias is positive in both matching directions, with 
the number for increasing as (0.068) more positive than 
the number for decreasing erg (0.032). This is because 
the medium peaks are dominated by noise, rather than 
halos, and the peaks which increase in height because of 
the positive noise have a larger chance to survive as a 
peak than those that are hurt by the noise. The scatter 
for medium peaks is « 0.32, about twice the scatter for 
high peaks. 

Finally, we examine the "movement" of the peaks in 
height k as erg is varied, in order to test our hypothe- 
sis, stated above, that an increase in erg tends to evac- 
uate peaks from near the k ~ (or near the maximum 
of the peak- height distribution). For example, we divide 
the 29,097 medium-height peaks in the low-erg model into 
several cases. Approximately half (12,738) of these peaks 
are unmatched: they "disappear" when erg is increased 
(equivalently, these are peaks that "appear" when one 
starts from the fiducial model, and decreases er 8 ). The 
remaining 16,359 matched peaks are further divided into 
middle, lower and high cases, based on whether they re- 
main in the original medium-K bin (7,683), exit this bin 
toward higher n (5,373) or to lower k (3,303), in the other 
cosmology. These results, as well as the corresponding re- 
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FIG. 7: The figure shows the results of an attempt to match individual high and medium peaks in a pair of cosmologies, as 
explained in the text. The x-axis shows the peak height in the starting cosmology model, and the y-axis shows the height of the 
"same" peak in the model to be matched with the starting model (whenever the peak has a match). The top-left panel shows 
matches found in the fiducial model for the high peaks in the low-as cosmology (and vice versa in the top-right panel); the 
bottom two panels repeat the exercise for medium peaks. Typically ~ 80% of the high peaks have a match, but only ~ 50% of 
the medium peaks do. 



class 


low-erg — > fiducial 


fiducial — Y low-erg 


exit to low k 


3303 


3420 


stay in bin 


7683 


7515 


exit to high n 


5373 


4408 


total matched 


16359 


15343 


lost (unmatched) 


12738 


12213 



TABLE III: We sort medium peaks into different categories, 
based on the outcome of our attempt to find a match for each 
peak in another cosmology. The total number of medium peaks 
in the low-as model and the fiducial model are 29,097 and 
27,556 respectively. While going from one cosmology to the 
other, peaks can be "lost", they can stay within the same k 
bin, or they can move out of the bin to higher or lower k 
values. Additionally, new medium peaks appear. 



suits in the reverse matching direction, are summarized 
in Tabic Hm 

By examining the table, we conclude there is indeed 
a preferentially larger scatter in the direction out of 
the "medium" bin, when going from the lower erg to 
the fiducial model, compared to the reverse direction: 



(3,303 + 5,373) > (3,420 + 4,408). This table further re- 
veals that there are two distinct reasons for the decrease 
in medium-height peaks. Approximately 2/3rd of the to- 
tal decrease (of 29,097-27,556 = 1,541 ps 1500 peaks), 
or fa 900 peaks, can be attributed to the above men- 
tioned "scatter" due to the increased density contrast - 
i.e. more peaks moving out of the bin than into the bin. 
The remaining ~l/3rd of the decrease is due to losing 
peaks, i.e. ps 500 more peaks are destroyed than created, 
as us is increased. Based on the preceding discussion, we 
speculate that this latter affect is caused by the projec- 
tions of multiple halos, which can create and destroy the 
relatively low amplitude peaks. 

In summary, the results in this section suggest that 
medium-height peaks are almost always dominated by 
pure galaxy shape noise, but they receive a significant 
contribution from collapsed halos, with typically 4-8 ha- 
los in projection along the LOS. The halos drive the cos- 
mological sensitivity of these peaks in two ways: by (i) 
changing the amplitudes of the noise peaks, and by (ii) 
destroying and creating new peaks. Between these last 
two effects, in the case of o~s, we found that the first is 
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^twice as important as the second. 



B. Comparison to Gaussian Predictions 

Our next task is to examine whether (i) the statistics 
of the peaks, and (ii) their cosmology-sensitivity differs 
significantly from predictions in a Gaussian random field. 
The degree of any departure from a GRF is especially 
important to quantify for the medium peaks, since the 
results of the last section suggest that these are heavily 
dominated by pure Gaussian noise. 

Our main results are shown in Fig. [HJ which directly 
compare the peak counts in our simulated maps with 
those in a GRF. The GRF predictions are computed from 
the theoretical formula as discussed above, but using the 
(moments of) the power spectrum <j , ci, &2 that were 
measured from the corresponding simulated maps. In 
each panel, we also compare the high-os, fiducial, and 
low-as models. In the bottom of each panel, we also show 
(i) the fractional difference between the GRF and the 
fiducial model and (ii) how the change in the peak counts 
between pairs of cosmologies is different in the GRF and 
our simulated maps. The source galaxies are assumed to 
be at z s — 2, and all results shown in the figure include 
1 arcmin smoothing. We plot the mean number of peaks 
in convergence bins of width Aft = \u no isc = 0.0045, av- 
eraged over 1000 realizations. In the top two panels, we 
exclude noise from the maps; in the bottom two panels, 
noise is included. Finally, in the right two panels, we have 
scaled the convergence field ft by its r.m.s. value a K (these 
histograms use a bin width of A(k/<t k ) — 0.25). This re- 
moves information that arises from a K alone. If the sole 
effect of changing erg was to change the heights of individ- 
ual peaks by a constant factor, then this would result in 
a re-scaling of the peak-height probability distribution; 
the re-scaling by a K clarifies the relative importance of 
this effect. 

As these figures show, in the noiseless case, the peak 
height distributions are very different from the Gaussian 
predictions and are reminiscent of the skewed one-point 
function of k, which has a sharp drop at low demagnifica- 
tion, and a long tail to high magnification (e.g. ref. [B] and 
references therein) . This correspondence of the high-tails 
is not entirely surprising; indeed, a pixel with a very high 
k value is likely to mark a peak. When noise is added, 
the distributions near their peaks look much more sim- 
ilar to the Gaussian predictions. However, there is still 
a large non-Gaussian deficit of the lowest peaks (with 
the most negative K pea k) aud a clear excess of the high- 
est (ft >3cr K ) peaks. Importantly, however, there also 
remains a clear difference in the peak-height distribu- 
tions even for the medium-height (ft ~ a K ) peaks. Fi- 
nally, as illustrated in the bottom insert in each panel, 
the cosmology-sensitivity of our peak histograms is also 
different from that in a GRF. These last points are en- 
couraging, and suggest that the medium peaks do contain 
non-Gaussian information. 



map 
type 


cosmology 
pair 


boundary 
locations 


noisy us 
noisy sc 


F and High-erg 
F and High-erg 


-0.0028 0.0217 0.0407 0.0695 
0.2650 0.6682 1.3550 3.3013 


noisy us 
noisy sc 


F and High-f2 m 
F and High-fi m 


-0.0050 0.0200 0.0383 0.0627 
0.4618 0.9950 1.5750 3.0556 


noisy us 
noisy sc 


F and High-iu 
F and High-iu 


-0.0019 0.0190 0.0347 0.0551 
0.2565 1.1450 2.4939 3.0368 



TABLE IV: Examples of bin boundaries used for the con- 
vergence peak counts. The boundaries are listed for unsealed 
("us") and scaled ("sc") noisy maps, used to compute Ax 2 be- 
tween the fiducial model and the high-ag, high-Q m , and high-w 
models, respectively. In the unsealed case, the boundary loca- 
tions are in units of the dimensionless convergence ft; in the 
scaled case, they are in units of v = k/<j k . 



map 


cosmology 


noiseless Ax 2 


noisy Ax 2 


type 


pair 


unsealed scaled unsealed scaled 


Sim 


F and High-erg 


5.16 


0.46 


5.89 


4.29 


GRF 


F and High-erg 


10.65 


0.23 


5.87 


3.16 


Sim 


F and Low-erg 


5.01 


0.34 


5.09 


3.67 


GRF 


F and Low-erg 


9.93 


0.16 


4.98 


2.58 


Sim 


F and High-f2 m 


3.61 


0.033 


4.02 


2.46 


GRF 


F and High-fi m 


7.68 


0.014 


3.77 


2.01 


Sim 


F and Low-f2 m 


4.39 


0.053 


4.44 


2.56 


GRF 


F and Low-$l m 


8.79 


0.043 


4.08 


2.11 


Sim 


F and High-w 


0.98 


0.47 


0.65 


0.27 


GRF 


F and High-w 


0.93 


0.017 


0.46 


0.14 


Sim 


F and Low-w 


0.44 


0.27 


0.36 


0.16 


GRF 


F and Low-to 


0.54 


0.004 


0.26 


0.08 



TABLE V: Ax 2 values from our simulated maps and from 
predictions in a GRF, based on the difference in the peak 
height distributions between the fiducial model and six other 
models, varying erg, w, and Q m . Results are shown for both 
the unsealed (N(k)) and the scaled (N(y)) peak distributions. 
Source galaxies are assumed to be at z s — 2, and a set of 
2 x 1000 maps are used in comparing each pair of cosmolo- 
gies. 



IV. DISCUSSION 

A. Sensitivity to Cosmological Parameters 

The number counts have been found (Paper I) to de- 
pend sensitively on a combination of (<7g,w;). Here we 
vary <t 8 and w separately, in order to clarify the sensi- 
tivity to each of these parameters; we also co nsid er vari- 
ations in Q m . We use Ax 2 , defined in Eq. (19 1 above, 



to measure the significance of the difference in the peak 
counts N(k), caused by the changes in these parameters. 
We used fiducial and other cosmological maps to calcu- 
late the change in N(k), but we used the control maps 
to compute the covariance matrix. Having 9 times more 
strictly independent realizations (45 control maps vs 5 
realizations in the fiducial model) allows us to compute 
the covariance matrix more accurately. To isolate the 
sensitivity from beyond a change in the r.m.s. cr K , we 
also compute the A% 2 's between the scaled peak height 
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FIG. 8: Number of peaks in our simulated 3.46 x 3. 46deg 2 (including the 3 arcmin edge) convergence maps, in bins of width 
Ak = 0.0045. In the right two panels, we have scaled the convergence field k by its r.m.s. value o~ K ; this removes information 
arising solely from a K . The source galaxies are assumed to be at z s = 2, and all results include 1 arcmin smoothing. In 
each panel, the three curves with data points correspond (from top to bottom on the right) to the high-a$, fiducial, and low-as 
models. The other three curves show theoretical predictions for peak counts in Gaussian random fields with the same three power 
spectra. In the bottom of each panel, we show the fractional difference between our fiducial model and a GRF for the peak counts 
{{N 3 i mu — Nqrf) /Ngimu) (demonstrating that the WL peaks are strongly non-Gaussian), as well as for the difference in the 
peak counts between pairs of cosmologies {(ANsimu — ANgrf)/ AN a imu) (explicitly demonstrating that the cosmology-sensitivity 
of our peak histograms is also different from that in a GRF). The four panels (upper-left, upper-right, lower-left, lower-right) 
show the results in noise-free unsealed, noise-free scaled, noisy unsealed and noisy scaled maps. 



distributions N(u). In these analyses, we use five conver- 
gence bins whose locations are chosen by visual inspec- 
tion, as explained above. Examples of bin boundaries we 
used are listed in Tabic Hvl 

Our main results are shown in Table [Vj and can be 
enumerated as follows. 

Raw cosmology sensitivity. The simulated noisy A\ 2 
values in the unsealed maps are significant (A% 2 <~ 4—6), 
and suggest that the cosmological sensitivity of the peak 
counts is competitive with other methods (after scal- 
ing to the full size of an all-sky survey, such as LSST; 
this extrapolation is discussed further below). The sen- 
sitivity for w is about an order of magnitude weaker 
(Ax 2 ~ 0.3 — 0.6) than for the other parameters. How- 
ever, this is the case for other observables, such as the 



power spectrum, as well. As shown below (see Table XIV 



and related discussion) the peak counts and the power 
spectrum individually have similar sensitivities to all 
three parameters; they can furthermore be combined to 
improve the marginalized errors by a factor of «two on 
all three parameters. 

Can we "scale out" the cosmological information? By 
comparing the scaled and unsealed cases in the noisy 
maps, we see that scaling the maps by the variance er K re- 
duces the A% 2 values only by a modest amount. In these 
maps, only a small fraction of the parameter-sensitivity 
arises through changes in o~ K . Interestingly, the situation 
is different in the raw, noiseless maps. Nearly all of the 
sensitivity in these maps are attributable a K : the A% 2 
values diminish significantly after the scaling. This re- 
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peak 
type 


cosmology 
pair 


K>bin 
boundaries 


noisy low 
noisy medium 
noisy high 


F and High-erg 
F and High-erg 
F and High-erg 


-0.2697 -0.031 -0.0250 
0.0150 0.0214 0.0390 0.0460 
0.1070 0.1240 1.4000 


noisy low 
noisy medium 
noisy high 


F and High-0, n 
F and High-f2 m 
F and High-f2 m 


-0.2697 -0.031 -0.0250 
0.0156 0.0313 0.0469 
0.1000 0.1150 1.4000 



TABLE VI: Bin boundaries used in the analysis to identify 
the relative importance of low, medium, and high peaks. The 
boundaries (including the end-points) are listed for the noisy 
A\ 2 between the fiducial model and the high-a% and high-Q, m 
models, in units of k. Two adjacent bins are used in the low 
and high ranges and either two or three bins in the medium 
range. 



suit is somewhat counter-intuitive, and implies that there 
is a "non-linear" interaction between noise and physical 
structures. More precisely, the result can be re-stated 
as follows: before adding noise, the cosmology-induced 
changes are very similar to a uniform 'stretching' of the 
peak height distribution along the x-axis. However, once 
the noise is added, the cosmology-induced changes are no 
longer described by such stretching. In hindsight, this is 
not entirely surprising: given that noise has almost no 
effect on the highest peaks, and has increasingly larger 
impact on the lower peaks, it is to be expected that the 
addition of noise spoils the "linear" stretching. 

The impact of noise. We find, furthermore, that the 
addition of the noise increases the Ax 2 values for as and 
for fi m , while for varying w, noise hurts. This is similar to 
a result we found in Paper I, namely that noise increases 
the change in the number of peaks. While in the unsealed 
maps, the increase in Ax 2 is modest, in the scaled maps, 
the increase is very significant (the interpretation of this 
is already explained in the preceding paragraph). This 
result - i.e. that the addition of pure noise helps increase 
the A% 2 - is also somewhat counter-intuitive, and will 
be discussed in detail in 



IV E below. 



Which peaks drive the sensitivity? To answer this ques- 
tion, we calculate A% 2 values separately for peaks with 
low, medium and high amplitudes. In the scaled maps, 
the low range is chosen to be below -0.8er K . The medium 
range is ±0.5er K wide, centered on the mode of the peak 
height distribution. The high range is defined to be above 
the height at which the peak-height distributions start to 
differ significantly in any given pair of cosmology. This 
is inferred visually from logarithmic scaled peak height 
distributions, such as those shown in the right two panels 
in Fig. [HJand typically falls at ~ 3.5er K , or k ~ 0.1. On 
the unsealed maps, we use the same boundaries as above, 
converted to k values using the fiducial model. Both in 
the low and high ranges, we adopt two (adjacent) bins, 
and in the medium range, we use either two or three bins 
(depending on where the peak-height distributions in a 
given pair of cosmologies cross). Table VI summarizes 
the bin boundaries. 

We show the resulting A% 2 values from each type of 



peak 
type 


cosmology 


noiseless Ax 2 noisy Ax 2 
unsealed scaled unsealed scaled 


low 
medium 
high 
frac. 


F and High-erg 
F and High-erg 
F and High-erg 
F and High-erg 


0.62 0.035 0.32 0.025 
2.55 0.024 3.30 0.87 
2.85 0.069 2.13 0.36 
1.17 0.28 0.98 0.29 


low 
medium 
high 
frac. 


F and Low-erg 
F and Low-erg 
F and Low-erg 
F and Low-erg 


0.34 0.05 0.21 0.02 
2.92 0.04 3.00 0.62 
1.73 0.09 1.27 0.32 
1.00 0.52 0.88 0.26 


low 
medium 
high 
frac. 


F and High-fi m 
F and High-S7 m 
F and High-f2 m 
F and High-fi m 


0.53 0.009 0.23 0.01 
1.53 0.004 2.65 0.70 
1.30 0.01 0.94 0.08 
0.93 0.68 0.95 0.33 


low 
medium 
high 
frac. 


F and Low-f2 m 
F and Low-f2 m 
F and Low-Q m 
F and Low-f2 m 


0.36 0.01 0.25 0.025 
2.16 0.007 3.00 0.70 
1.04 0.003 0.79 0.093 
0.81 0.36 0.91 0.32 



TABLE VII: Ax 2 values arising separately from peaks in the 
low, medium and high range, with bin boundaries as specified 
in Table {Vl\ The fiducial model is compared to models varying 
as and Q, m . The 4 th (last) row in each case shows the sum 
of the low, medium and high Ax 2 's divided by the total Ax 2 
obtained previously and listed in Table \V\ 



peak in Table |VTlj for models varying as and f2 m . As 
the difference caused by w is small, we do not discuss it 
here. We also list the ratio [Ax 2 (low) + Ax 2 (medium) + 
Ax 2 (high)] /A% 2 (tot), where the numerator refers to the 
values calculated here, and the denominator to the to- 
tal Ax 2 computed above from the entire k range. Even 
though the low/medium/high ranges we use are disjoint, 
this ratio can exceed unity (if in the original Ax 2 , the 
bins were non-ideally placed). The table shows that in 
the noisy maps, by far the largest contribution comes 
from peaks in the medium range. These are followed in 
importance by the high and the low peaks. We also see 
that in the noisy unsealed case, the low, medium and 
high ranges together account for essentially all (> 88%) 
of the total unsealed Ax 2 - In the scaled case, they add 
up to a smaller fraction (« 30 — 60%) of the total. 

How robust are the results? One may ask whether 
the Ax 2 values (e.g. listed in Table [v} are robust un- 
der changes of the random realizations of the underly- 
ing maps. This is a potential concern especially when 
the Ax 2 values are low. We used our control maps to 
re-compute both the covariance matrix, and the change 
in N(k), and to see how the Ax 2 values change. We 
found values of Ax 2 > 1 are very stable, and change 
by < 5%. For 0.1 < Ax 2 < 1.0, the change is - 20%, 
and for the smallest Ax 2 < 0.05, (occurring in scaled 
noiseless maps), the change is ~ 50%. Our finite num- 
ber of realizations is therefore only adequate to give an 
order-of-magnitude estimate of the distinction between 
these pairs of maps. We note that increasing the number 
of bins also changes the Ax 2 values (increasing them by 
~ 10%; see section |IV G|) , but this change is systematic, 



and docs not influence our conclusions. 
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Model 


noiseless Ax' 1 


noisy 


Fiducial 


164.20 


44.98 


High-cr 8 


191.22 


64.47 


Low-ct 8 


130.20 


32.23 


High-w; 


177.85 


47.91 


how-w 


157.33 


46.06 


High-n m 


180.53 


55.14 


Low-f2 m 


146.27 


36.15 



TABLE VIII: A\ 2 values derived from the peak height distri- 
butions between simulated maps and corresponding GRF pre- 
dictions. Source galaxies are assumed to be at z s = 2, and 
1000 noise free or noisy maps are used for each cosmological 
model. 



B. Distinction from a Gaussian Random Field 

We next turn to the question of whether the cosmol- 
ogy sensitivity offers information beyond a pure GRF. 
We know that the high peaks are non-Gaussian, whereas 
the medium peaks, which drive the sensitivity, appear 
to follow the GRF predictions more closely (though still 
visibly deviate from them, even in the noisy maps). 

We begin by directly quantifying the difference be- 
tween the GRF and the simulat ed pea k-height distribu- 



tions (shown in Fig. [8| . In Table VIII we show the A% 2 
values between the maps and the corresponding Gaussian 
predictions. In order to isolate the non-Gaussian effects 
in the mean peak counts, the covariance matrix is evalu- 
ated in the fiducial model (i.e. the covariance matrix in 
the GRF is not used). The large numbers in this table 
reveal that overall, the peaks are highly non-Gaussian, 
even in the noisy case. Similar to Table |VII| we study 
the significance of the non-Gaussianity separately for low, 
medium and high k peaks. We chose the ranges and the 
bin boundaries by the same procedure as described for 
Table [VlT] Our results are summarized in Table |TX] and 
quantify the expectation that both the medium and high 
peaks differ significantly from the GRF predictions, even 
in the noisy maps (although the noiseless maps are more 
non-Gaussian). In the noisy maps, the significance of 
the non-Gaussianity for the low peaks is relatively low, 
but this is likely a result of the relatively small number 
of these low peaks. The table also shows that the three 
disjoint regions together account only for about 20-30% 
of the total A^ 2 . This implies that the peak height dis- 
tribution departs from the GRF prediction everywhere 
(and no k values are unimportant for the total A% 2 ). 

The above demonstrates that one can (easily) tell the 
difference of each map from a GRF. We next ask whether 
the cosmology-induced changes also differ from those in 
the Gaussian case. To answer this, we first calculate the 
A% 2 between a pair of cosmologies, using the expecta- 
tion values of the GRF peak counts in both cosmologies, 
computed from Eq. ( 10 ), with ctq, 1,2 derived from the sim- 



peak 


cosmology 


Ay" 

A. 


Ay 2 

A 


type 


pair 


(noiseless) ( 


noisy) 


low 


Fiducial 


5.94 


1.01 


medium 


Fiducial 


9.56 


3.08 


high 


Fiducial 


17.94 


9.79 


frac. 


Fiducial 


0.20 


0.31 


low 


High-os 


7.43 


1.68 


medium 


Hie:h-<Ta 


6.76 


3.40 


high 


High-erg 


31.13 


18.55 


frac. 


High-ag 


0.24 


0.37 


low 


Low-ag 


3.25 


0.59 


medium 


Low-<xg 


13.41 


2.78 


high 


Low-ag 


10.10 


5.20 


frac. 


LoW-fTg 


0.21 


0.27 


low 


High-w 


5.78 


1.05 


medium 


High-w 


9.98 


2.72 


high 


High-'u; 


17.65 


9.98 


frac. 


High-'u; 


0.19 


0.29 


low 


Low-iu 


6.05 


1.12 


medium 


Low-iu 


6.91 


2.48 


high 


IjOw-w 


18.34 


9.73 


frac. 


IjOw-w 


0.20 


0.29 


low 


High-fi m 


6.87 


1.39 


medium 


High-fi,„ 


6.78 


3.45 


high 


High-O m 


26.45 


15.23 


frac. 


High-f2 m 


0.22 


0.36 


low 


Low-f2 m 


4.27 


0.82 


medium 


Low-f2 m 


14.03 


3.21 


high 


Low-f2 m 


11.36 


6.09 


frac. 


Low-f2 m 


0.20 


0.28 



ulated maps in the corresponding models. These results 
are listed below the cosmological A^ 2 values in Table |v| 
In order to isolate non-Gaussian effects in the mean peak 



TABLE IX: A\ 2 values of peak height distribution in low, 
medium and high range, between simulation and GRF theo- 
retical formula. Rate shows sum of low, medium and high 
A\ 2 divided by the total A\ 2 . Source is at z = 2. 



counts from those in the covariance matrix, in these GRF 
calculations, we again use the covariance matrix from the 
simulated fiducial model. Therefore, these GRF A\ 2 val- 
ues do not represent the absolute distinguishability of the 
two GRF maps; they are meant only to be compared to 
the Ax 2 's from the corresponding cosmological simula- 
tions. 

As the comparisons of two adjacent rows in the ta- 
ble shows, the noisy Ax 2, s in the simulations are gener- 
ally close, overall, to the corresponding values predicted 
in the GRF. This, of course, does not necessarily mean 
that the information is the same as in a GRF - indeed, 
we found above that the mean counts deviate from the 
GRF predictions even in the noisy case. However, the 
cosmology-induced differences in the peak counts are, ap- 
parently, similar in magnitude to that in a GRF. When 
we repeat the GRF calculations with the covariance ma- 
trix adopted from the mock GRF maps, we find that, 
typically, the A% 2 values increase by about a factor of 
~4. We find that in the GRF case, the standard devi- 
ation in the peak counts in each bin is close to Poisson 
shot noise ~ v^V- In our maps, the fluctuations are typ- 
ically larger, by up to a factor of ~4, which explains the 
corresponding reduction in the A% 2 . Interestingly, the 
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variance in the 3D space density of ~ 10 14 M Q clusters 
has been found to exceed Poisson noise by a similar fac- 
tor [23], providing physical intuition for this result. 

Finally, comparing the simulated and GRF-predicted 
values in the noiseless case, we see that non-Gaussianity 
reduces the significance between cosmologies before scal- 
ing, but increases the significance after scaling. This re- 
sult makes sense: the peak counts in the Gaussian case 
follow an almost strict linear scaling with a K , hence much 
of the difference disappears after such scaling (although 
the linear scaling is not, in fact, exact, see Eqs. ( 10 l-dTT])). 



C. Information Beyond the Power Spectrum 

In the last section, we found that the peak height dis- 
tribution is very different from the expectation in a GRF 
with the same power spectrum, especially for high peaks. 
While this is encouraging, we next study directly how 
much information is beyond the power spectrum. This 
is important, since in general, a random field can be 
non-Gaussian, but could still be fully characterized by 
its power spectrum. For example, one can imagine that 
the 3-point (and higher order) correlation functions are 
pre-specified functions of the power spectrum. 

We here measure the 2d convergence power spectrum 
£(£ + 1)P(£) directly from the map produced in each re- 
alization, and treat it as another observable, in addition 
to the peak height distribution. The factor £(£ + 1) 
is included to make the observables in each bin close 
in magnitude (which helps make matrix inversion more 
stable). To be more specific, we first computed the 
power spectrum in 200 equal-sized finer bins, with width 
At = 531. The power was evaluated by taking the 
Fourier transform of the convergence field and averag- 
ing the power in each finer bin in the radial direction. 
To calculate the Ax 2 's, we considered only the range 
100 < £ < 20, 000, and divided this range into 5 equal- 
sized bins (linear in £) , and assigned each of the 200 finer 
bins into one of these 5 bins (for computing marginalized 
errors, we use 15, rather than 5 bins; see below). Us- 
ing th ese five equally- spaced bins, we evaluate the mean 
Pi = £i(£i + l)Pi(£i) within each bin i, as well as the 
elements CV, = ((Pi — Pi)(Pj — Pj)) of the new 5 x 5 co- 
variance matrix. Ax 2 is then computed between pairs of 
cosmological models, analogous to Eq. ( 19 1 for the peak 
counts. 

In order to find the information beyond the power spec- 
trum, we first calculate the A\ 2 using the power spec- 
trum alone. We then combine the five peak counts and 
the five power spectra into a vector of 10 observables, and 
compute the cross-terms, dj = ((Pi — Pi)(Nj — Nj)), 
to obtain the elements in the off-diagonal blocks of the 
10 x 10 covariance matrix. This allows us to calculate 
A\ 2 from the N (k) + P(£) combination, taking into ac- 
count their correlations. 

The results are shown in Table |Xj Comparing the in- 
dividual A% 2 's first, we see that in the noisy maps, the 



observable type 


cosmology 


noiseless Ax 2 


noisy Ax 2 


Peak Counts (Axn) 


Fiducial 


5.16 


5.89 


Power Spectrum (Axp) 
Combination (Axnp) 
A X 2 NP /(A X 2 N + Axl) 


and 


17.06 


8.12 


High-as 


37.07 


16.36 




1.67 


1.17 


Peak Counts (A^at) 


Fiducial 


5.01 


5.09 


Power Spectrum (Axp) 
Combination (Axnp) 
A x %p/(Ax 2 n + A X p) 


and 


13.03 


5.76 


Low-0"8 


26.79 


11.87 




1.49 


1.09 


Peak Counts (Axn) 


Fiducial 


3.61 


4.02 


Power Spectrum (Axp) 
Combination (Ax%p) 
A X 2 N p/(A X 2 N + Axl) 


and 


17.69 


6.15 


High-fi m 


32.42 


11.65 




1.52 


1.15 


Peak Counts (Axn) 


Fiducial 


4.39 


4.44 


Power Spectrum (Axp) 
Combination (Ax%p) 
A X 2 NP /(A X 2 N + A X 2 P ) 


and 


16.49 


5.61 


Low-£7 m 


29.47 


10.94 




1.41 


1.09 


Peak Counts (Ax^) 


Fiducial 


0.98 


0.65 


Power Spectrum (Axp) 
Combination (Axnp) 
A x 2 np/(Ax 2 n + A x 2 p) 


and 


0.92 


0.29 


High-ui 


2.79 


0.84 




1.46 


0.90 


Peak Counts (Axn) 


Fiducial 


0.44 


0.36 


Power Spectrum (Axp) 
Combination (Axnp) 
A x 2 N p/(A x 2 N + A X %) 


and 


0.64 


0.19 


Low-uj 


1.69 


0.51 




1.57 


0.92 



TABLE X: Ax 2 from peak counts, power spectra, and their 
combination, computed between the fiducial model and six 
other models varying as, w, and Q m independently. Ax%p 
denotes the Ax 2 from the combination of peak counts and 
power spectrum, including their correlations; Ax% and Ax% 
denote the individual Ax 2 's. 1,000 noise-free or noisy maps 
are used for each of the model. Source galaxies are at z s — 2. 



sensitivity of peak counts is roughly comparable to the 
power spectrum, although about ~ 50% weaker for ct s 
and f2 m , and about twice stronger for w. In the noiseless 
maps, the power spectrum is more sensitive, especially 
for erg and f2 m . This shows, interestingly, that the power 
spectrum sensitivity is much more degraded by noise than 
the peak counts. This is not surprising, given that the 
constraints from the power spectrum are dominated by 
linear fluctuations on relatively large scales [29j [30] , with 
noise adding linearly to the large-scale structure signal. 
In other words, unlike for peak counts, adding noise does 
not change the signal (i.e., the difference APi between 
two cosmologies), as long as the galaxy noise is inde- 
pendent of cosmology, whereas the noise increases the 
variances ((Pi ~ P7)(Pj ~ Pj))- 

Inspecting next the combined Ax 2 's (shown in the 
third row in each section of Table [X]) , we find that these 
are comparable to adding the two individual A% 2 val- 
ues. This would be expected if there were no cross- 
correlations between power spectra and peak counts. In- 
deed, this result appears consistent with the negligible 
correlation between the 3d space density of clusters and 
the 2d convergence power spectrum [29l [31]. Interest- 
ingly, however, in the noiseless maps, the N(k) + P(£) 
combination yields a better sensitivity than adding two 
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source 


cosmology 


noiseless Ax z 


noisy Ax 2 






unsealed scaled unsealed scaled 


z2 


Fiducial 


5.16 


0.46 


5.89 


4.29 


zl 


and 


3.36 


0.66 


2.67 


2.56 


zl2 


High-ag 


5.99 


0.91 


6.16 


4.84 


zl2/(z2+zl) 




0.70 


0.81 


0.72 


0.71 


z2 


Fiducial 


5.01 


0.34 


5.09 


3.67 


zl 


and 


3.27 


0.73 


2.23 


2.23 


zl2 


Low-a"8 


5.90 


0.94 


5.29 


4.05 


zl2/(z2+zl) 




0.71 


0.88 


0.72 


0.69 


z2 


Fiducial 


3.61 


0.033 


4.02 


2.46 


zl 


and 


4.47 


0.044 


2.97 


2.15 


zl2 


High-f2 m 


5.41 


0.067 


4.51 


3.12 


zl2/(z2+zl) 




0.67 


0.87 


0.65 


0.68 


z2 


Fiducial 


4.39 


0.053 


4.44 


2.56 


zl 


and 


5.30 


0.051 


2.86 


2.23 


zl2 


Low-f2 m 


6.51 


0.082 


4.76 


3.15 


zl2/(z2+zl) 




0.67 


0.79 


0.65 


0.66 


z2 


Fiducial 


0.98 


0.47 


0.65 


0.27 


zl 


and 


1.24 


0.58 


0.40 


0.20 


zl2 


High-ui 


1.57 


0.83 


0.70 


0.37 


zl2/(z2+zl) 




0.71 


0.80 


0.67 


0.79 


z2 


Fiducial 


0.44 


0.27 


0.36 


0.16 


zl 


and 


0.94 


0.39 


0.37 


0.19 


zl2 


Low-w 


1.12 


0.56 


0.48 


0.28 


zl2/(z2+zl) 




0.81 


0.85 


0.66 


0.80 



TABLE XI: This table examines a simple case of tomography 
with two redshifts. A\ 2 values are shown between the fiducial 
model and six other models varying og, w, and Q m , for both 
unsealed and scaled peak height distributions, obtained using 
1,000 noise-free or noisy maps. Source galaxies are located at 
z s = 1, at z s = 2, or at both redshifts (denoted by zl, z2, and 
zl2). The rows labeled by "zl2/(z2+zl)" show the combined 
A\ 2 divided by the sum of the individual A\ 2 of zl and z2. 



observables independently (by ~ 50%; see each 4th row 
in the Table). This "sum greater than its parts" ef- 
fect can arise whenever N(k) and P(£) have a nonzero 
correlation (ANAP) ^ 0, and the cosmology-induced 
changes SN and SP do not obey the same correlation. 
It can be verified, after some algebra, that in the case 
of two observables N and P, individually yielding 
and A%p, the condition for Axj^ P > + ^Xp is 

(SNSP)/(ANAP) < (A X 2 N + Axp)/2. By inspecting 
the cross-terms in our 10 x 10 covariance matrix, we have 
verified that this condition is satisfied for each JV< and Pj 
pair whose combination enhances their A% 2 . For exam- 
ple, when cr 8 is decreased, the peak counts in the lowest 
bin (Ni) decrease, as do the power spectra - however, the 
covariance matrix predicts an anti-correlation between 
Nx and all five Pj's. 



D. Redshift Tomography 

Our analysis above relied on a single source galaxy 
redshift at z s = 2. In a realistic survey, there will of 
course be a distribution of galaxy redshifts. Using galax- 
ies at different redshifts ("tomography") could, in princi- 



ple, strengthen cosmological constraints significantly, de- 
spite the strong correlations in the signal measured at 
different source galaxy planes [30l |32] • 

Here we evaluate the benefits of tomography in the 
simplest case of having source galaxies at two distinct 
redshifts. We calculate the A\ 2 from the peak counts, 
as before, from source galaxies (15 arcmin" 2 ) separately 
at z 8 = 1 and z s = 2, using five convergence bins at each 
redshift. We then combine these, and calculate the A^ 2 
using both redshifts (i.e. a total of 30 galaxies arcmin -2 ) 
and their joint 10x10 covariance matrix. This calculation 
includes the covariance across the two redshift bins, and 
is analogous to the combination of the peak counts and 
the power spectra described in the previous section. 

The results are shown in Table IXIl In each section of 
the table, the first three rows show A\ 2 at z s = 2, z s = 1, 
and the combined constraints. The fourth row shows the 
ratio of the combined A% 2 to the sum of the individual 
A% 2 's at the two redshifts. This last quantity checks the 
importance of the covariance between the two redshifts. 
It would be unity if the two PDFs were completely inde- 
pendent, but can be either larger or smaller than unity if 
the correlations between the two redshifts are important 
(as discussed in previous section). 

Comparing the individual redshifts first, as the first 
two rows in the table show, in the noisy maps, z s = 2 
generally yields a better sensitivity than z s = 1. The 
only exception is the low-w case, when the sensitivities 
at the two redshifts are comparable (with z s = 1 only 
slightly better). This is consistent with our results in 
Paper I, in which we have also found that the sensitivity 
to a combination of (a^.w) increases with source galaxy 
redshift. The advantage of higher redshift is explained by 
the accumulation of a larger overall lensing signal, when 
going to a large distance. In our noiseless fiducial maps, 
we have a K — 0.022 and 0.013 at z s = 2 and z s = 1, 
respectively, which is to be compared to our assumed 
noise of cr n oisc = 0.023 and er n oisc = 0.019 at z s = 2 and 
z 8 = 1. Comparing the noisy and the noiseless results in 
Table |XI[ we see that adding noise to the unsealed maps 
for z s = 1 always hurts, and decreases the A% 2 values. 
In contrast, adding noise increases the A\ 2 values for 
z s = 2. We find that at both redshifts, adding noise 
enhances the difference in the total number of peaks (this 
counterintuitive result is explained in detail in the next 
section) . However the A\ 2 depends not only on the total 
number of peaks, but also on the shape of the peak height 
distribution. At z s = 1, where a K is well below the noise 
"noise, the peak count shape distribution is much more 
vulnerable to the noise. 

It is worth noting that, apart from the importance of 
noise, there are trends with redshifts arising from the 
cosmological dependence of (i) geometrical distance fac- 
tors in the lensing kernel, and (ii) from the growth of 
the matter perturbations. Individually, both of these 
depend on cosmology, with the induced differences in- 
creasing with redshift, and strengthening the sensitivity. 
However, there are cancellations when the effects from 
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the geometry and growth work in the opposite direction, 
which weakens the overall sensitivity. This cancellation 
can be worse at high redshift. This explains why in the 
noiseless case, z s = 1 is, in fact, better than z s = 2, for 
both w and tt m . Reducing Q m or increasing w both re- 
sult in flatter growth (i.e. larger density fluctuations at 
high redshift, for fixed as), which is canceled by a re- 
duction in the lensing kernel. For <7 8 , when only growth 
effects are present, and there are no such cancellations, 
the sensitivity always increases with redshift. 

Inspecting next the combined Ax 2 's, we see that in 
general, tomography does not significantly improve the 
sensitivity, compared to having only the more sensitive 
of the two redshifts. This is partly due to the fact 
that the less sensitive of the two redshifts is significantly 
less sensitive, and partly due to the covariance between 
the two redshifts, which reduces the combined A% 2 by 
^30% compared to having two uncorrelated measure- 
ments. (The typical value of the "covariance parame- 
ter" shown in the fourth row in each section of Table IXII 
is ~0.7.). We emphasize, however, that the change in 
the peak counts induced by each parameter has, in gen- 
eral, a different redshift-dependence. Therefore, tomog- 
raphy can still be very useful to improve the marginal- 
ized constraints, whenever there is a strong degeneracy 
between parameters at a single redshift (see discussion of 
marginalized constraints below). 



E. Why Does Noise Increase the Signal-to-Noise? 

An interesting finding in this paper is that adding ran- 
dom noise can sometimes boost the cosmology sensitivity 
of peak counts (i.e., at high redshift, as mentioned in last 
section). A similar result - namely that the difference 
in the total number of peaks is increased when noise is 
added - was found (but not explored) in Paper I. This is 
a counter-intuitive result that we investigate here. 

For simplicity, the discussion below will be restricted to 
the Ax 2 obtained from the total number of peaks (effec- 
tively using a single convergence bin) . We find that noise 
boosts these A^ 2 , as well. For example, the mean num- 
ber of peaks in our fiducial, high-ag and high-f2 m models 
are 2337.9, 2326.4 and 2339.3 in noise free maps, and 
3414.6, 3362.2 and 3369.7 in noisy maps, respectively. 
The r.m.s of the total number of peaks in the fiducial 
model is 35 in the noise-free maps and is only slightly 
larger, 38, in the noisy maps. This implies that, for ex- 
ample, for erg, noise increases A\ 2 from 11.5/35 ps 0.3 to 
52.4/38 « 1.4. 

Below, we will use the predictions in a GRF to explain 
such an increase. The advantage of using a GRF is that 
the peak counts are analytically predictable, allowing us 
to understand the effect of the noise exactly. Also, as 
we showed earlier, the cosmology-induced differences in 
the peak counts are generally close to those in a GRF 
(even though the peak height distributions are dissim- 
ilar). Therefore, it is reasonable to use the GRF as a 



guide to understand a boost in the Ax 2 - 

The galaxy shape noise added to our maps is assumed 
to be uncorrelated in each pixel - this corresponds to 
a GRF with a flat power spectrum, or "white noise". 



Applying the definition of o~ p in Eq. ( 14 ) to such white 
noise, and assuming a smoothing scali 
following relations between er n0 , 



e V G 



we find the 



'nl, Cn2- 



' n2 



2 


4 


nl 


W 




G 


2 


8 


nO 


M, 







(24) 
(25) 



where a n o 



is given by Eq. 

-2" 



{2). In our case, with 
7ig a i = 15 arcmin - ' 4 , and 1 arcmin smoothing at 
z s = 2, we find the numerical values (c 2 , cr 2 l5 a 2 2 ) = 
(0.00051354,3.6975 deg" 2 , 53244 deg- 4 ). 

For arbitrary power spectra, the total nu mbe r of peaks 
is given by a constant x(cr|/(T 2 ) (see Eq. (15)). In the 
following analysis, we drop this constant for convenience. 
Let us next denote the cr's in the first cosmology by Co, 
1 , 02, and in the second cosmology by er , a[, a 2 . In the 
absence of noise, the difference in total number of peaks 
is given by 



A; 



■pk 



Ki 



(26) 
(27) 



where r\ = a' 2 ja\ and r 2 = cr 2 /a 2 . Since the noise is as- 
sumed to be uncorrelated with the noise-free convergence 
field, the cr's of the of the noise-free field and of the noise 
field add linearly. Therefore, the difference in the total 
number of peaks, after the noise is added, is given by: 



pk, noise 



crj (1 + a 2 ) _ of (r 2 + Qg) 
a\ (1 + 01) a\ (n + 01) 
1 r (l + a 2 ) _ (r 2 + a 2 ) 
.(l + oi) (n + 01) 



(28) 
(29) 
(30) 



where a\ = 0^1/ °~i an d °2 = cr n2/ <7 I- 

We now look at the magnitudes of cti and a 2 (which ex- 
press the importance of noise relative to the cosmological 
lensing signal) and r\ and r 2 (which express the changes 
caused by the cosmology). Considering, as an example, 
the fiducial model and the high-cr 8 model as the second 
(primed) cosmology, we have a\ = 3.368, a 2 — 5.537, 
r\ = 1.197 and r 2 = 1.189. Clearly, r\ and r 2 are very 
close to each other, whereas a 2 differs significantly from 
a\. Looking at Eqs. (28) and (26), we see that r 2 /ri w 1 



implies An p k will be small, and comparing the factors 
multiplying the term o-\ja\, we infer An p k, noise > An p k 
as long as T\ ~ r 2 > 1 and a 2 > at > 1. In our case, 
we find An p k jn oise = 54.2 and An p k = 15.3. This then 
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implies a significant increase in the total A% 2 , provided 
that the r.m.s of total number of peaks doesn't increase 
much (which is indeed the case; we find that the r.m.s. in- 
creases by « 8%). The first of the two conditions respon- 
sible for Aripk, noise > An p k, namely that t\ sa ri, says 
that the first and second derivatives of the correlation 



function (Eq. 14 1 scale very similarly with our parame- 
ter, cr 8 . This makes sense, and would indeed hold strictly 
(with r\ — r<i) in the linear regime. However, although 
we are using GRFs, we adopt the nonlinear power spectra 
from the simulations, and therefore the scaling with a% 
is stronger than linear on small scales. We have verified 
that the small (< 1%) difference we find between r\ and 
r2 is not a numerical artifact, and a similar difference is 
present when we compute the u's from the theoretical 
power spectra [TO]. The second of the two conditions is 
that et2 differs significantly from a\, with a 2 > a\ (note 
that since noise is added on top of the maps, (02, <Xx) > 1 
always holds), but not overwhelmingly > 1 (otherwise 
noise would dominate the lensing signal, and there would 
be no distinction). This also makes sense: the relative im- 
portance of the noise and the cosmological lensing signal 
is wavelength-dependent. More specifically, the latter de- 
creases with increasing wavenumber, and therefore noise 
is increasingly important on small scales - as a result, 
whenever the noise is significant, it has a bigger effect on 
the second derivatives than on the first. 



The above result raises two more questions. First, 
what is the ideal noise level, which maximizes the signal 
Aript (or, ultimately, the actual A% 2 values in the sim- 
ulated WL maps)? Also, how much noise is too much? 
As the noise is increased, eventually it must hurt, and 
reduce An p k below its noiseless value. At what level of 
noise does this occur? In order to answer these questions, 
we first repeated the analysis in the GRF case, but mul- 
tiplied the noise cr nu by a constant factor. This increases 
<7„i and <7„2 by the same factor, and ai and a 2 by the 
square of this factor, so the dependence of An p k on the 
noise level can be simply obtained from equation (30 1. 



In practice, we went through the exercise of adding ran- 
dom noise with different amplitudes to the mock GRF 
maps. We found that the difference in the peak counts 
(between the fiducial and the high-cg models) followed 
very accurately the predictions from equation ( p0"| ) . Hav- 
ing the maps then allowed us to compute the variance 
in the number of peaks (Sn^) (in our fiducial model). 
These results are shown in the 2nd and 3rd columns of 
Table |XTT[ We then performed the same exercise for the 
simulated WL maps (again between the fiducial and the 
high-08 models), adding different levels of noise, and re- 
computing Anpit (shown in the 4th column of Table XII I, 
as well as the A\ 2 both scaled and unsealed, as defined 
above (5th and 6th columns). 5 



Noise 


GRF 




Simulations 


Level 


Aripk 




Aripk 


Ax 2 


Ax 2 








(scaled) 


(unsealed) 





15.3 


35.0 


11.5 


7.20 


0.43 


0.5 


75.2 


42.4 


65.5 


8.41 


5.08 


1* 


54.2 


37.7 


52.4 


6.57 


4.23 


2 


20.1 


31.5 


21.1 


2.16 


1.00 


4 


5.6 


31.1 


5.8 


0.27 


0.10 



TABLE XII: The difference in the total number of peaks be- 
tween the fiducial model and the high-ag model, as a function 
of the level of the noise, in the GRF case (2nd column) and in 
the simulated WL maps (4-th column). The first column shows 
the numerical factor by which we multiplied the original noise 
level (the third row, marked with a star, corresponds to the 
original noise). The 3rd column shows the r.m.s. of the peak 
counts in the fiducial model and the 5th and 6th columns show 
A^ 2 values as a function of the noise. In all cases, we find 
that the two cosmologies are best distinguished when approx- 
imately half of our original noise is added to the maps; the 
distinction rapidly decreases for noise > twice our original 
value. 



Table |XTT| shows that there is an "ideal" noise level, 
at which An p k is maximized. This turns out to be ap- 
proximately half the noise we adopted. There is also a 
level (approximately twice larger than we adopted), be- 
yond which noise actually hurts in the absolute sense, 
i.e. An p k becomes smaller than in the noise-free case. 
As the noise is increased further, An p k tends to zero, as 
it should. Interestingly, these conclusions hold, both in 
the GRF and the simulated maps (2nd and 4th rows). 
We found that the variance in the peak counts does not 
change significantly as noise is added (either for a GRF 
or in our simulated maps; the GRF case is shown in the 
3rd column). Most importantly, the actual A^ 2 values 
are also maximized at ^half of our original noise (5th and 
6th columns); the unsealed A% 2 , however, drops quickly 
below the noiseless case when the noise exceeds the orig- 
inal value. 

The above analysis demonstrates that the naive intu- 
ition, namely that noise can only decrease the signal- 
to-noise ratio (which is manifestly true when the signal 
and noise add linearly) , no longer holds in our case. This 
naive intuition is known to fail when the " signal" is a non- 
linear function of the noise. Indeed, noise can amplify the 
signal non-linearly, via a phenomenon called "stochastic 
resonance" [33j . under three generic conditions: (i) the 
presence of some form of threshold in the definition of 
the signal, (ii) a weak coherent input, and (iii) a source 
of noise that adds to the coherent input. It is interesting 
to note that these three conditions are satisfied in our 



5 In this last analysis, we used fixed k bins with roughly equal 
counts. This was necessary to avoid choosing different bound- 



aries, for each noise level, by the ad-hoc opt imiz ation procedure 
used above. This causes Ax 2 values in Table |xiT| to differ slightly 
from those in Table [V] but should not affect our argument and 
conclusions here. Bin boundaries are discussed in detail in § |IV G| 
below. 
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Aiipk 


(fiducial)- (high-erg ) 


(fiducial)- (high-f2 m ) 




noisy 


noiseless 


noisy 


noiseless 


GRF 


54.2 


15.3 


43.6 


-1.7 


Sim 


52.4 


11.5 


44.9 


-1.4 



TABLE XIII: The change in the total number of peaks when 
the fiducial model is compared to the high-as or the high-Q m 
model. As the table shows, the changes are significantly en- 
hanced by noise, and are similar in the GRF and the simula- 
tions. 



WL peak-counts, and hence WL peaks appear to be an 
example of this phenomenon; this connection is worth 
exploring further in future work. 

Finally, when we compare the fiducial model to the 
high-f2 m cosmology, we find that noise boosts the signal 
even more significantly than for the high-<7 8 model. In 
this case, we have a\ = 3.368, ct2 = 5.537, r\ = 1.1707, 
and r-i = 1.1716. Clearly, both conditions above are still 
satisfied, and we obtain An p k, noise = 43.6 compared to 
A"-pk = —1.7. Note that in this case, An^ is negative, 
because r\ <T2- For reference, the total number of peaks 
in our fiducial, high-erg and high-f2 m models, calculated 
through Eq. pi, is 2275.0, 2259.7 and 2276.7 in noise 



free maps, and 3431.3, 3377.1 and 3387.7 in noisy maps. 
These differ from the total counts in the simulations, 
quoted in the beginning of this section, by An « —60 and 
An +20 in the noise-free and noisy cases, respectively. 
However, as already noted above, the cosmology-induced 
differences are very similar in the simulations and the 
GRF case, as summarized in Table IXIII 



ences in mean peak counts between models. An impor- 
tant question is whether these A\ 2 can be interpreted as 
likelihoods, or confidence levels on parameter estimates. 
If the observables (in our case, the peak counts in each 
bin) were Gaussian distributed, and if they depended lin- 
early on the the parameters (in our case, the cosmological 
parameters), then our A% 2 would follow true \ 2 distri- 
butions. When fitting a single parameter, as in our case 
above, A\ 2 — 1, 4, 9 would then correspond to the usual 
68.3%, 95.4%, 99.7% confidence levels. 6 

To see how good the above approximations are, in 
Fig. p| we first show the distribution of X/( r ) over dif- 
ferent realizations in the fiducial model itself, shown by 



the red crosses (computed from Eq. (21 1, using noiseless, 
unsealed maps, and z s — 2). We fit these data with a 
standard chi-squared distribution P % 2 (ax 2 , DOF), treat- 
ing a linear scaling constant a and the number of degrees 
of freedom DOF as free parameters. We find best-fit val- 
ues of a = 1.0015 and DOF = 5.23. Reassuringly, the fit, 
shown by the red solid curve, is very good, with a close 
to 1, and DOF close to 5, the number of bins we used. 

DOF = 5 is also shown 

2 / 



For comparison, P x 2 with a = 1 



as the blue dashed curve. Clearly, xj ( r ) closely follows a 
chi-squared distribution expected if the deviation of peak 
counts from the mean were Gaussian. 

These results justify interpreting our Ax 2 's in the Ta- 
bles above as (single-parameter) confidence levels. As 
seen in Table |Vj the unsealed, noisy maps when erg 
and TO are varied correspond to "2 — 2.5cr" differ- 
ences from the fiducial model; w variations correspond 
to "0.5 - 0.8cr" differences. 



F. Can Ax be Interpreted as a Likelihood? 



G. Impact of the Choice of Binning 
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FIG. 9: The PDF of X/( r ) over different realizations within 
our fiducial model (red crosses). The data is fit well by true 
chi-sguared distributions: the solid red and blue dashed curves 
show true x 2 -distributions with 5.23 and 5 degrees of free- 
dom. Noise-free, unsealed maps were used for this figure, with 
source galaxy redshift z s — 2. 

So far, we have quoted the A% 2 values based on differ- 



In all our results above, we have used a fixed number of 
(five) bins, and performed only an ad-hoc optimization 
of the bin boundaries by hand. It is important to ask 
how our results are affected both by the number of bins, 
and by the placement of the bin boundaries. Ideally, 
one could use arbitrarily fine binning, and avoid such 
questions; in practice, we arc limited by the finite number 
of realiza tion s we can simulate. 

In Fig. 10 we show the Ax 2 in noisy unsealed maps, 
with z s = 2, between the fiducial model and the low-os, 
low-it) and low-fi m models, as a function of the number 
of bins. We chose the bin boundaries either by following 
the approach of avoiding "crossings" of the peak-count 
PDFs in a pair of cosmologies within bins, or such that 
the mean number of peaks in each bin were the same. As 



Note that we neither have an actual data set, nor do we perform 
a x 2 minimization to find the best-fit parameters. We are thus 
effectively assuming that the mean peak counts in our fiducial 
model are the data, yielding our fiducial parameter as the best 
fit; we can then find the confidence limits corresponding to the 
other six models. 
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FIG. 10: A\ 2 from peak counts, as a function of the number 
of convergence bins. The fiducial model was compared to the 
low-as, low-Q, m , and low-w models, using noisy unsealed maps 
with z 3 = 2. Bin boundaries were chosen such that each bin 
contains equal counts, or such that "crossings" of the peak- 
count PDFs in a pair of cosmologies are avoided within bins 
(as labeled). The y-axis labels on the left refer to the erg and 
Q,m cases; the labels on the right to the w case. 



the figure shows, the "avoid crossings" approach works 
quite well, and from 5 to 20 bins, the A% 2 's increase only 
modestly (by « 10%). The "equal counts" approach does 
more poorly (yielding smaller A% 2 ) when the number of 
bins is small, but converges to a very similar values once 
the number of bins is >15. These results give reassurance 
that we have a sufficient number of bins and the Ax 2 's 
shown in the Tables above have converged to within ~ 
10%. 

In the next section, we vary multiple parameters si- 
multaneously. In Figure |11[ we show the marginalized 
errors of the three cosmological parameters <7g, f2 m , w 
from the combination of the peak counts and the power 
spectrum, as a function of the number of bins (in noisy 
unsealed maps, with z s = 2). When choosing the bound- 
aries, we applied the "equal counts" approach for the 
peaks and the "equally spaced" approach for the power 
spectrum. The details of computing the marginalized er- 
ror is explained in the next section. The figure shows 
that the five bins are not sufficient in this case; however, 
the marginalized error converges to within < 10% once 
the number of bins is >15. 



H. Forecasting Marginalized Errors 

In all previous calculations, we have varied a single pa- 
rameter, holding all the other parameters fixed. While 
this clarifies the raw cosmological sensitivity of the peak 
counts, justified if CMB (or other) observations can be 
used to determine the parameters with negligibly small 
errors, one has to simultaneously vary all uncertain pa- 
rameter, and consider their degeneracies, to obtain re- 
alistic error forecasts (even in the limiting case of no 
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FIG. 11: The marginalized errors of the three cosmological pa- 
rameters as, Q. m , w from the combination of the peak counts 
and power spectrum, as a function of the number of conver- 
gence bins. The fiducial model was compared to the high/low- 
os, high/low-Q m , and high/low-w models, using noisy un- 
sealed maps with z 3 = 2. Bin boundaries were chosen such 
that each bin contains equal counts for peak counts and equally 
spaced with the cut at £ = 20, 000 for the power spectrum. The 
y-axis labels on the left refer to the erg and J7 m cases; the labels 
on the right to the w case. 



systematic errors). While numerical limitations preclude 
us from exploring the full cosmological parameter space, 
we here use a Fisher matrix to obtain marginalized er- 
rors when the three parameters a^, w, and Q m are var- 
ied simultaneously. Degeneracies between these param- 
eters are among the most important for both cluster 
counts (e.g. ref. [51]) and for shear power spectra (e.g. 



marginalized error 


OS 


w 


£7 m 


z2 
zl 
z2+zl 

Power Spectrum (z s = 2) 


0.0065 
0.0078 
0.0024 
0.0047 


0.030 
0.036 
0.018 
0.026 


0.0057 
0.0057 
0.0022 
0.0028 


z2+Power Spectrum 
zl+Power Spectrum 
tomography combined 
combined/( z2+Power Spectrum) 


0.0026 
0.0037 
0.0012 
0.47 


0.012 
0.020 
0.0096 
0.79 


0.0019 
0.0026 
0.0010 
0.52 



TABLE XIV: Marginalized 68% errors, in our noisy maps, on 
the cosmological parameters os, w, and Sl m . In the top half of 
the table, peak counts and power spectra are considered sepa- 
rately. From top to bottom: (i) counts alone at z s = 2; (ii) 
counts alone at z s — 1; (Hi) counts alone with both z s = 1 and 
z s — 2; (iv) power spectrum alone at z s = 2. In the bottom 
half of the table, counts and the power spectrum are combined. 
From top to bottom: (v) combining counts and power spec- 
trum at z a — 2; (vi) combining counts and power spectrum 
at z s — 1; (vii) combining the above two cases to use all 4 
observables - peak counts and power spectrum at z s — 2 and 
z s — 1; and finally (viii) the last combined results (row vii) 
divided by the "z2+Power Spectrum" results (row v). Each 
error quoted is marginalized over the other two parameters, 
and are scaled to a 20,000 deg 2 survey, such as LSST. 



22 



-0.92 
-0.94 
-0.96 - 
-0.98 
-1 
-1.02 
-1.04 
-1.06 



~i 1 1 

z2+PowerSpectmm 
zl+PowerSpectmm 
z2+zl combined 
fiducial 




_l_ 



_l_ 



0.792 0.794 0.796 0.798 0.8 0.802 0.804 

°8 



0.268 
0.266 
0.264 
0.262 
0.26 
0.258 
0.256 



0.254 



"T" 



T" 



z2+PowcrSpectmm 
zl+PowerSpectmm 
z2+zl combined 
fiducial 




0.792 0.794 0.796 0.798 0.8 0.802 0.804 



z2+PowerSpectrum 
zl+PowerSpectrum 
z2+zl combined 
— fiducial 




0.268 
0.266 
0.264 
0.262 
G 0.26 
0.258 
0.256 
0.254 

-1.06 -1.04 -1.02 -1 -0.98 -0.96 -0.94 -0.92 
w 

FIG. 12: 68% percentile error ellipses in two-dimensional pro- 
jections of the 3 -dimensional parameter space of as,, w, and 
Q m - In each panel, we show "tomography" results from noisy, 
unsealed maps for combining peak counts and power spectrum 
using either z s = 2, z s = 1, or their combination. The con- 
straints are scaled to a 20,000 deg 2 survey, such as LSST. 



ref. 1 1 3 1 ) . We compute the marginalized errors from 
Eq. ( 22 ). We use the finite difference between the fiducial 



model and the low(high)-<78, low(high)-w and low(high)- 
Q m models to estimate the backward (forward) deriva- 
tives with respect to these parameters. The average of 
backward and forward derivatives is used to calculate the 
Fisher matrix. 

As mentioned above, we use 15 bins for the peak 
counts. The simple intuitive ad-hoc optimization of the 
bin boundaries, based on avoiding crossings in a single 
pair of cosmologies, which was used in the case of a sin- 
gle parameter, cannot be generalized in a straightforward 
way to the multi-parameter case. Indeed, we have found 



that when we use five bins, the results become sensitive 
to the choice of the cosmology pair over which the bin 
boundaries are optimized. Therefore we use the simpler 
(and unambiguous) scheme of equal-count bins; as shown 
in the previous section, the accuracy in this case conver- 
gences for >15 bins. We emphasize that whenever there 
are significant degeneracies between parameters, the nu- 
merical accuracy requirements on the individual elements 
of the Fisher matrix become more stringent, on the in- 
dividual elements of the Fisher matrix To validate our 
results, we have checked that our marginalized errors do 
converge when we use >15 bins (see Fig. |TTj above) . 

We found that, in addition to the binning, the 
marginalized errors for the peak counts, with a single 
source galaxy redshift, are sensitive to the direction of 
taking the finite-difference derivative (backward or for- 
ward). If we take any one of backward, forward and av- 
eraged derivatives for any of the parameters: as, w, f2 m , 
among the marginalized errors of these 27 combinations, 
the marginalized errors for the three parameters vary by 
about 20% — 25%. On the other hand, the results become 
more stable when we combine the peak counts with the 
power spectrum (reducing the variations to 10% — 15%; 
to be consistent, we also use 15 bins for the power spec- 
trum for computing marginalized errors). This behav- 
ior is consistent with the presence of strong degenera- 
cies between parameters, which are broken when Fisher 
matrices corresponding to two or more observables are 
added (as shown for the combination of cluster counts 
and power spectra [2"9"1 13"5] ) . 

In Table pQV] we show the results from combining peak 
counts and power spectrum in noisy, unsealed maps, us- 
ing either z s = 2, z s = 1, or their combination. All 
the numbers in this table are scaled to the solid angle 
of 20,000 deg 2 , representing an all-sky survey such as 
LSST. We simply divide our results from the 12 deg 2 
maps by a factor \/20, 000/12 (see discussion in Paper I 
for this simple "extrapolation" ) . Comparing the individ- 
ual peak-count (1st row) and power spectrum (4th row) 
errors at z s = 2 in the top half of the table with their 
combination (5th row), we see that the individual errors 
are roughly similar, considering the 20% — 25% variation 
in the peak counts, whereas the combination improves 
on either by a factor of «two. As the table shows, com- 
bining the peak counts and power spectrum at z s = 2 
yield 68% constraints as tight as (Acr 8 , ASl m , Aw) = 
(0.0026,0.0012,0.019). Combining the peak counts and 
power spectrum at z s = 1 gives a constraints worse than 
at z s — 2, with Aa$ and Afi m larger by ~ 30% and Aw 
larger by ~ 60%. This agrees with the previous A^ 2 
results that z s = 2 generally yields a better sensitivity 
than z s = 1. We found that the marginalized errors from 
two redshifts together decrease significantly compared to 
the marginalized errors from z s = 2 alone. For the com- 
bination of the peak counts and the power spectrum, we 
found Acg and Af2 m are ~ 50% of the corresponding er- 
rors from z s = 2, Aw is ~ 75% of the error from z s = 2. 
The large decrease of marginalized error from tomogra- 
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phy is different from the results of A% 2 of peak counts: 
tomography does not significantly improve the sensitiv- 
ity due to the correlation between two redshifts. This 
difference between A\ 2 (showing only raw cosmological 
sensitivity) and marginalized errors (showing also the de- 
generacy between cosmological parameters) clearly shows 
that the change in peak counts induced by each param- 
eter has a different redshift-dependence. To be specific, 
the w-induced chang function of redshift, can- 

not be degenerate with, for example, the Omega-induced 
changes (even if they can be very degenerate at a sin- 
gle redshift, blowing up the marginalized errors). Over- 
all, these constraints are comparable to those expected 
from other forthcoming cosmology probes (see Paper I for 
a discussion), although a fair comparison would involve 
replicating the cosmological parameter set, and other as- 
sumptions made elsewhere, which is beyond the scope of 
this study. 

Our results are also shown graphically in Fig. |12[ which 
show the 68% joint two-parameter constraints (i.e., corre- 
sponding to Ax 2 = 2.3, and marginalized over the third 
parameter). 

V. SUMMARY AND CONCLUSIONS 

In this paper, we used ray-tracing simulations, and 
a halo finder, to study the halo contributions to peaks 
present in convergence maps, expected in large forth- 
coming weak lensing surveys of the sky. This allowed 
us to understand the origin of relatively low-amplitude 
"0.5 — 1.5er", or "medium" peaks, and their sensitivity 
to cosmology. Our motivation to focus on these peaks is 
that they have been shown to drive the overall cosmology- 
sensitivity of the peak counts. Given that weak lensing 
by large scale structure is among the most promising cos- 
mological datasets, expected to be available in the near 
future, the cosmological information content of these ro- 
bustly measurable features must be understood. 

We have found that unlike high peaks, which are typi- 
cally produced and dominated by a single collapsed halo, 
the medium peaks are primarily caused by random noise. 
However, these medium peaks receive an important con- 
tribution from a projection of multiple (typically, 4-8) 
halos along the line of sight, which makes their number 
counts sensitive to cosmological parameters. We have 
shown that for source galaxies at high redshift (z s = 2) 
the presence of noise boosts the distinguishing power from 



peak counts - a counter-intuitive result that we have clar- 
ified analytically. 

Our most important results are that the distribution 
of the medium peaks differ from similar-height peaks in a 
pure Gaussian random field (GRF). We have shown, ex- 
plicitly, that the peaks contain cosmological information 
that differs from that in a GRF, and is non-degenerate 
with the power spectrum of the convergence. We have 
taken the first steps toward more realistic error fore- 
casts, by obtaining the marginalized errors in the three- 
dimensional parameter space of cr$, u>, and Q m . The 
results suggest that peak counts will play a significant 
role in tightening cosmological constraints from forth- 
coming large-solid-angle weak lensing surveys. On the 
other hand, we have not addressed here the long list of 
systematic errors that will ultimately limit the utility of 
convergence peaks. Given that the sensitivity relies heav- 
ily on peaks whose height is close to that of the expected 
noise, these issues will be especially important to address 
in future work. More generally, our results should moti- 
vate investigations to extract yet more cosmological infor- 
mation from nonlinear weak lensing features; Minkowski 
functionals appear to be a promising possibility |36| . 
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